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A unique nucleic acid subfragment encoding lysine-insensitive aspartokinase (AK) is capable of causing a higher than nor- 
mal production i of threonine, when linked to a nucleic acid subfragment encoding lysine-insensitive dihydrodipicolinic acid syn- 
hase (DHDPS) higher than norma production of lysine. A further embodiment also includes a subfragment encoding antisense 
lysine ketoglutarate reductase. Nucleic aad sequences and chimeric genes are provided, as well as methods for increasing the Iv- 
sine or threonine content of seeds of plants. 6 ' 
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TITLE 

NUCLEIC ACID FRAGMENTS AND METHODS FOR 
INCREASING THE LYSINE AND THREONINE CONTENT 
OF THE SEEDS OF PLANTS 
5 TECHNICAL FIELD 

This invention relates to a unique synthetic 
nucleic acid subfragment encoding aspartokinase (AK) 
which is insensitive to inhibition by lysine. Methods 
for its use to produce increased levels of lysine or 
10 threonine in transformed organisms are also provided. 

BACKGROUN D OF THE INVENTION 

Human food and animal feed derived from many grains 
are deficient in some of the ten essential amino acids 
which are required in the animal diet. In corn ( Zea 

15 maxs. L.) f lysine is the most limiting amino acid for the 
dietary requirements of many animals. Soybean ( Glycine 
ma* L.) meal is used as an additive to corn based animal 
feeds primarily as a lysine supplement. Thus an 
increase in the lysine content of either corn or soybean 

20 would reduce or eliminate the need to supplement mixed 
grain feeds with lysine produced via fermentation of 
microbes . 

Plant breeders have long been interested in using 
naturally occuring variations to improve protein quality 

25 and quantity in crop plants. Maize lines containing 
higher than normal levels of lysine (70%) have been 
identified (Mertz, Science, 2A1, 279 (1964) and Nelson, 
Science, 150., 1469-70 (1965)). However, these lines 
which incorporate a mutant gene, opaque-2, exhibit poor 

30 agronomic qualities (increased susceptibility to disease 
and pests, 8-14% reduction in yield, low kernel weight, 
slower drying, lower dry milling yield of flaking grits, 
and increased storage problems) and thus are not 
commercially useful [Deutscher, D. Adv. Exp. Medirine 

35 and Biology, 1Q£, 281-300 (1978)]. Quality Protein 
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Maize (QPM) bred at CIMMYT using the opaque-2 and 
sugary-2 genes and associated modifiers has a hard 
endosperm and enriched levels of lysine and tryptophan 
in the kernels [Vasal, S. K. , et al. Proceedings of the 
5 3rd seed protein symposium , Gatersleben, August 31 - 

September 2 r 1983] . However, the gene pools represented 
in the QPM lines are tropical and subtropical. Quality 
Protein Maize is a genetically complex trait and the 
existing lines are not easily adapted to the dent 

10 germplasm in use in the United States, preventing the 
adoption of QPM by corn breeders. 

The amino acid content of seeds is determined 
primarily (90—99%) by the amino acid composition of the 
proteins in the seed and to a lesser extent (1-10%) by 

15 the free amino acid pools. The quantity of total 

protein in seeds varies from about 10% of the dry weight 
in cereals to 20-40% of the dry weight of legumes. Much 
of the protein-bound amino acids is contained in the 
seed storage proteins which are synthesized during seed 

20 development and which serve as a major nutrient reserve 
following germination* In many seeds the storage 
proteins account for 50% or more of the total protein. 

To improve the amino acid composition of seeds 
genetic engineering technology is being used to isolate, 

25 and express genes for storage proteins in transgenic 

plants. For example, a gene from Brazil nut for a seed 
2S albumin composed of 26% sulfur-containing amino acids 
has been isolated [Altenbach et al. (1987) Plant Mol. 
Biol. 8:239-250] and expressed in the seeds of 

30 transformed tobacco under the control of the regulatory 
sequences from a bean phaseolin storage protein gene. 
The accumulation of the sulfur-rich protein in the 
tobacco seeds resulted in an up to 30% increase in the 
level of methionine in the seeds [Altenbach et al. 

35 (1989) Plant Mol. Biol. 13:513-522]. However, no plant 
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seed storage proteins similarly enriched in lysine 
relative to average lysine content of plant proteins 
have been identified to date, preventing this approach 
from being used to increase lysine. 
5 Lysine, along with threonine, methionine and 

isoleucine, are amino acids derived from aspartate, and 
regulation of the biosynthesis of each member of this 
family is interconnected. Regulation of the metabolic 
flow in the pathway appears to be primarily via end 

10 products. The first step in the pathway is the 

phosphorylation of aspartate by the enzyme aspartokinase 
(AK) , and this enzyme has been found to be an important 
target for regulation in many organisms . However, 
detailed physiological studies on the flux of 4-carbon 

15 molecules through the aspartate pathway have been 

carried out in the model plant system Lemna paucicostata 
[Giovanelli et al. (1989) Plant Physiol. 90:1584-1599]. 
The authors state "These data now provide definitive 
evidence that the step catalyzed by aspartokinase is not 

20 normally an important site for regulation of the entry 
of 4-carbon units into the aspartate family of amino 
acids [in plants]." 

The aspartate family pathway is also regulated at 
the branch-point reactions. For lysine this is the 

25 condensation of aspartyl P-semialdehyde with pyruvate 
. catalyzed by dihydrodipicolinic acid synthase (DHDPS) , 
while for threonine and methionine the reduction of 
aspartyl 0- semi aldehyde by homoserine dehydrogenase 
(HDH) followed by the phosphorylation of homoserine by 

30 homoserine kinase (HK) are important points of control. 
Many attempts have been made to isolate lysine 
over-producing mutants of plants by selecting for 
resistance to the lysine analog S (2-aminoethyl) -cysteine 
(AEC) , either alone or in conjunction with lysine plus 

35 threonine resistance selections. No examples of 
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increases in seed lysine levels that would be 
commercially valuable have yet been reported, however. 

An alternative approach is to increase the 
production and accumulation of specific free amino acids 
5 such as lysine via genetic engineering technology. A 
number of genes for key regulatory enzymes of the 
aspartate family pathway have been isolated from various 
sources. The coli dapA gene encodes a DHDPS enzyme 
that is about 20-fold less sensitive to inhibition by 

10 lysine than wheat germ DHDPS. The £. coli dapA gene has 
been linked to plant gene expression sequences, 
introduced into tobacco cells via transformation and 
shown to cause a substantial increase in free lysine 
levels in leaves [Glassman et al. (1989) PCT Patent 

15 Appl. PCT/US89/01309] . However, no evidence that 

increased lysine was found in seeds was presented (even 
though seeds were available to test) . The £. dapA 
gene, linked to plant gene expression sequences, has 
also been introduced into tobacco cells by Galili et al. 

20 [(1991) Abstr. 422 from Third Int. Cong, of Int. Soc. 
Plant Mol. Biol.] and by Shaul et al. [(1992) Plant 
Jour. 2:203-209] and shown to result in an increase in 
free lysine levels in leaves. Again, no evidence for 
increased levels of free lysine in seeds was presented. 

25 These workers have recently reported on the introduction 
of an £. coli lysC gene that encodes a lysine- 
insensitive AK enzyme into tobacco cells via 
transformation [Galili et al. (1992) Eur. Patent Appl. 
91119328.2; Shaul et al. (1992) Plant Physiol. 

30 100:1157-1163]. Expression of the £. ££LL enzyme 

results in increases in the levels of free threonine in 
the leaves and seeds of transformed plants. Crosses of 
plants expressing coli DHDPS and AK resulted in 
progeny that accumulated more free lysine in leaves than 

35 the parental DHDPS plant, but less free threonine in 
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leaves than the parental AK plant . No evidence for 
increased levels of free lysine in seeds was presented. 

The limited understanding of the details of the 
regulation of the biosynthetic pathway in plants makes 
5 the application of genetic engineering technology, 
particularly to seeds, uncertain. There is little 
information available on the source of the aspartate- 
derived amino acids in seeds. It is not known, for 
example, whether they are synthesized in seeds, or 

10 transported to the seeds from leaves, or both, from most 
plants. In addition, free amino acids make up only a 
small fraction of the total amino acid content of seeds. 
Therefore, over-accumulation of free amino acids must be 
many-fold in order to significantly affect the total 

15 amino acid composition of the seeds. Furthermore, 

little is known about catabolism of free amino acids in 
seeds. Catabolism of free lysine has been observed in 
developing endosperm of corn and barley. The first step 
in the catabolism of lysine is believed to be catalyzed 

20 by lysine-ketoglutarate reductase [Brochetto-Braga et 
al. (1992) Plant Physiol. 98:1139-1147]. Whether such 
catabolic pathways are widespread in plants and whether 
they affect the level of accumulation of free amino 
acids is unknown. 

25 Before this application no method to increase the 

level of lysine, or any other amino acid, in seeds via 
genetic engineering was known. Thus, there is a need 
for genes, chimeric genes, and methods for expressing 
them in seeds so that an over-accumulation of free amino 

30 acids in seeds will result in an improvement in 
nutritional quality. 

SUMMARY OF THE INVENTION 

Applicant's invention is an isolated nucleic acid 
fragment comprising (a) a first nucleic acid subfragment 
35 encoding AK which is insensitive to inhibition by lysine 
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and (b) a second nucleic acid subfragment encoding 
dihydrodipicolinic acid synthase which is at least 
20-fold less sensitive to inhibition by lysine than 
plant DHDPS. A preferred embodiment further comprises a 
5 third nucleic acid subfragment encoding antisense lysine 
ketoglutarate reductase. In a further embodiment, the 
first and second subfragments are operably linked to a 
suitable plant chloroplast transit sequence and each 
subfragment is also linked to a suitable regulatory 
10 sequence to promote expression in plants. A preferred 
embodiment is either of the inventions described above 
in which: 

(a) the first nucleic acid subfragment 
comprises a nucleotide sequence substantially homologous 

15 to the sequence shown in SEQ ID NO:l encoding a lysine- 
insensitive variant of coli AKIII and further 
characterized in that at least one of the following 
conditions is met: 

(1) the amino acid at position 318 is an 
20 amino acid other than threonine, or 

(2) the amino acid at position 352 is an 
amino acid other than methionine; or 

(b) the second nucleic acid subfragment is 
derived from bacteria. 

25 A further embodiment is the invention in which: 

(a) the first nucleic acid subfragment is 
further characterized in that at least one of the 
following conditions is met: 

(1) the amino acid at position 318 is 
30 isoleucine, or 

(2) the amino acid at position 352 is 
isoleucine; or 

(b) the second nucleic acid subfragment is 
derived from bacteria, preferably either £. coli or 

35 Coryneba cterium glutamic^. 
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Another embodiment of the invention is the isolated 
nucleic acid fragment comprising at least one nucleotide 
sequence substantially homologous to the sequence shown 
in SEQ ID NO:l encoding E. coli AKIII, said nucleic acid 
5 fragment encoding a lysine-insensitive variant of 
E. coli AKIII. 

The subject matter also includes an isolated 
nucleic acid fragment comprising a nucleic acid 
subf ragment encoding lysine ketoglutarate reductase. 

10 Plants f seeds and microorganisms transformed with 

the nucleic acid fragments described are also 
embodiments of the invention. Further embodiments of 
the invention are methods for increasing the threonine 
or lysine content of seeds by transforming plants with 

15 the unique nucleic acid fragments described. 

BRIEF DESCRIPTION OF TRF. 
DRAWINGS AND SEQUENCE DESCRIPTIONS 
The invention can be more fully understood from the 
following detailed description and the accompanying 
20 drawings and the sequence descriptions which form a part 
of this application. 

Figure 1 shows a schematic representation of gene 
expression cassettes. 

Figure 2 shows a map of the binary plasmid vector 
25 pZS97K. 

Figure 3 shows a map of the binary plasmid vector 
PZS97. 

Figure 4 shows a map of the binary plasmid vector 
pZS199. 

30 Figure 5 shows a map of the plasmid vector pBT603. 

SEQ ID NO;l shows the nucleotide and amino acid 
sequence of the coding region of the wild type £. coli 
lysC gene, which encodes AKIII, described in Example 1. 
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SEQ ID NOS: 2 and 3 were used in Example 2 to 
create an Nco I site at the translation start codon of 
the lysC gene. 

SEQ ID NOS; 4 and 5 were used in Example 3 as PCR 
5 primers for the isolation of the Coryneba cterium dapA 
gene. 

SEQ ID NO: 6 shows the nucleotide and amino acid 
sequence of the coding region of the wild type 
Corynebacterium dapA gene, which encodes lysine- 
10 insensitive DHDPS, described in Example 3. 

SEQ ID NO: 7 was used in Example 4 to create an 
Nco I site at the translation start codon of the E. coli 
da pA gene. 

SEQ ID NOS: 8, 9, 10 and 11 were used in Example 6 
15 to create a chloroplast transit sequence and link the 
sequence to the £• coli lysC . coli lysC- M4, coli 
dapA and Corynebacteria dapA genes . 

SEQ ID NOS: 12 and 13 were used in Example 6 to 
create a Kpn I site immediately following the 
20 translation stop codon of the E.. coli dapA gene. 

SEQ ID NOS: 14 and 15 were used in Example 6 as PCR 
primers to create a chloroplast transit sequence and 
link the sequence to the Corynebacterium dapA gene. 
SEQ ID NO: 16 was used in Example 6 as a 
25 constitutive expression cassette for corn. 

SEQ ID NOS: 17-22 were used in Example 6 to create a 
corn chloroplast transit sequence and link the sequence 
to the E. coli lysC- M4 gene. 

SEQ ID NOS:23 and 24 were used in Example 6 as PCR 
30 primers to create a corn chloroplast transit sequence 
and link the sequence to the EL. coli dapA gene. 

The Sequence Descriptions contain the one letter 
code for nucleotide sequence characters and the three 
letter codes for amino acids as defined in conformity 
35 with the IUPAC-IYUB standards described in Nucleic Acids 
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Research 13:3021-3030(1985) and in the Biochemical 
Journal 219 (No. 2) :345-373 (1984) which are incorporated 
by reference herein. 

PETAILEP DESCRIPTION OF THE INVE NTION 
5 In order to increase the accumulation of specific 

free amino acids in the seeds of plants via genetic 
engineering, Applicant determined which enyzmes in this 
pathway controlled the pathway. In order to accomplish 
this, genes encoding enzymes in the pathway were 

10 isolated from bacteria. In some cases, mutations in the 
genes were obtained so that the enzyme encoded was made 
insensitive to end-product inhibition. Because the 
genes were not of plant origin, they had to be linked to 
intracellular localization sequences and suitable 

15 regulatory sequences for expression in plants. The 
genes, individually or in combinations, were then 
introduced into plants via transformation and assessed 
for their ability to elicit accumulation of the desired 
amino acid(s) in the desired plant organ (s) . 

20 The teachings below describe nucleic acid fragments 

and procedures useful for increasing the accumulation of 
lysine and threonine in the seeds of transformed plants 
as compared to levels of the amino acids of 
untransformed plants. Specifically, Applicant has 

25 provided a unique first nucleic acid fragment comprised 
of two nucleic acid subf ragments, one encoding AK which 
is insensitive to inhibition by lysine and the other 
encoding DHDPS which is at least 20-fold less sensitive 
to feedback inhibition by lysine than a typical plant 

30 DHDPS, e.g., wheat DHDPS. It is the combination of 
subfragments which Applicant found successfully 
increases the lysine accumulated in seeds of transformed 
plants as compared to untransformed host plants. To 
achieve a greater accumulation of lysine in seeds of 

35 transformed plants, Applicant has also provided a 
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nucleic acid fragment containing an antisense Lysine 
Ketoglutarate Reductase (LKR) chimeric gene. This 
fragment can be linked to the first fragment or combined 
via crossing of plants transformed with the first 
5 fragment with plants transformed with the nucleic acid 
fragment containing an antisense LKR chimeric gene. 
Applicant has also provided a nucleic acid fragment 
encoding AK which is insensitive to lysine inhibition. 
Plants transformed with this fragment accumulate 

10 increased threonine in their seeds compared to 
untransformed plants. 

In the context of this disclosure , a number of 
terms shall be utilized. As used herein, the term 
"nucleic acid" refers to a large molecule which can be 

15 single-stranded or double-stranded, composed of monomers 
(nucleotides) containing a sugar, phosphate and either a 
purine or pyrimidine. A "nucleic acid fragment" is a 
fraction of a given nucleic acid molecule. In higher 
plants, deoxyribonucleic acid (DNA) is the genetic 

20 material while ribonucleic acid (RNA) is involved in the 
transfer of the information in DNA into proteins. A 
"genome" is the entire body of genetic material 
contained in each cell of an organism. The term 
"nucleotide sequence" refers to a polymer of DNA or RNA 

25 which can be single- or double-stranded, optionally 

containing synthetic, non-natural or altered nucleotide 
bases capable of incorporation into DNA or RNA polymers. 

As used herein, the term "homologous to" refers to 
the complementarity between the nucleotide sequence of 

30 two nucleic acid molecules or between the amino acid 
sequences of two protein molecules. Estimates of such 
homology are provided by either DNA-DNA or DNA— RNA 
hybridization under conditions of stringency as is well 
understood by those skilled in the art [as described in 

35 Hames and Higgins (eds.) Nucleic Acid Hybridisation, IRL 
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Press, Oxford, U.K.]; or by the comparison of sequence 
similarity between two nucleic acids or proteins. 

As used herein, "substantially homologous" refers 
to nucleic acid molecules which require less stringent 
5 conditions of hybridization than those for homologous 
sequences, and also refers to coding DNA sequence which 
may involve base changes that do not cause a change in 
the encoded amino acid, or which involve base changes 
which may alter one or more amino acids, but not affect 

10 the functional properties of the protein encoded by the 
DNA sequence. Thus, the nucleic acid fragments 
described herein include molecules which comprise 
possible variations of the nucleotide bases derived from 
deletion, rearrangement, random or controlled 

15 mutagenesis of the nucleic acid fragment, and even 

occasional nucleotide sequencing errors so long as the 
DNA sequences are substantially homologous. 

"Gene" refers to a nucleic acid fragment that 
expresses a specific protein, including regulatory 

20 sequences preceding (5' non-coding) and following (3* 

non-coding) the coding region. "Native" gene refers to 
the gene as found in nature with its own regulatory 
sequences. "Chimeric" gene refers to a gene comprising 
heterogeneous regulatory and coding sequences. 

25 "Endogenous" gene refers to the native gene normally 
found in its natural location in the genome. A 
"foreign" gene refers to a gene not normally found in 
the host organism but that is introduced by gene 
transfer. 

30 "Coding sequence" refers to a DNA sequence that 

codes for a specific protein and excludes the non-coding 
sequences . 

"Initiation codon" and "termination codon" refer to 
a unit of three adjacent nucleotides in a coding 
35 sequence that specifies initiation and chain 
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termination, respectively, of protein synthesis (mRNA 
translation) . "Open reading frame" refers to the amino 
acid sequence encoded between translation initiation and 
termination codons of a coding sequence. 
5 "RNA transcript" refers to the product resulting 

from RNA polymerase-catalyzed transcription of a DNA 
sequence. When the RNA transcript is a perfect 
complementary copy of the DNA sequence, it is referred 
to as the primary transcript or it may be a RNA sequence 

10 derived from posttranscriptional processing of the 

primary transcript. "Messenger RNA (mRNA) refers to RNA 
that can be translated into protein by the cell. n cDNA n 
refers to a double-stranded DNA that is complementary to 
and derived from mRNA. "Sense" RNA refers to RNA 

15 transcript that includes the mRNA. "Antisense RNA" 

refers to a RNA transcript that is complementary to all 
or part of a target primary transcript or mRNA and that 
blocks the expression of a target gene by interfering 
with the processing, transport and/or translation of its 

20 primary transcript or mRNA. The complementarity of an 
antisense RNA may be with any part of the specific gene 
transcript , i.e., at the 5' non-coding sequence, 3' non- 
coding sequence, introns, or the coding sequence. In 
addition, as used herein, antisense RNA may contain 

25 regions of ribozyme sequences that increase the efficacy 
of antisense RNA to block gene expression. "Ribozyme" 
refers to a catalytic RNA and includes sequence-specific 
endoribonucleases . 

As used herein, suitable "regulatory sequences" 

30 refer to nucleotide sequences located upstream (5M, 
within, and/or downstream (3*) to a coding sequence, 
which control the transcription and/or expression of the 
coding sequences, potentially in conjunction with the 
protein biosynthetic apparatus of the cell. These 

35 regulatory sequences include promoters, translation 
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leader sequences, transcription termination sequences, 
and polyadenylation sequences. 

"Promoter" refers to a DNA sequence in a gene, 
usually upstream (5 f ) to its coding sequence, which 
5 controls the expression of the coding sequence by 

providing the recognition for RNA polymerase and other 
factors required for proper transcription. A promoter 
may also contain DNA sequences that are involved in the 
binding of protein factors which control the 

10 effectiveness of transcription initiation in response to 
physiological or developmental conditions. It may also 
contain enhancer elements. 

An "enhancer" is a DNA sequence which can stimulate 
promoter activity. It may be an innate element of the 

15 promoter or a heterologous element inserted to enhance 
the level and/or tissue-specificity of a promoter. 
"Constitutive promoters" refers to those that direct 
gene expression in all tissues and at all times. 
"Organ-specific" or "development-specific" promoters as 

20 referred to herein are those that direct gene expression 
almost exclusively in specific organs, such as leaves or 
seeds, or at specific development stages in an organ, 
such as in early or late embryogenesis, respectively. 
The term "operably linked" refers to nucleic acid 

25 sequences on a single nucleic acid molecule which are 
associated so that the function of one is affected by 
the other. For example, a promoter is operably linked 
with a structure gene (i.e., a gene encoding 
aspartokinase that is lysine-insensitive as given 

30 herein) when it is capable of affecting the expression 
of that structural gene (i.e., that the structural gene 
is under the transcriptional control of the promoter) . 

The term "expression", as used herein, is intended 
to mean the production of the protein product encoded by 

35 a gene. More particularly, "expression" refers to the 
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transcription and stable accumulation of the sense 
(mRNA) or tha antisense RNA derived from the nucleic 
acid fragment (s) of the invention that, in conjuction 
with the protein apparatus of the cell, results in 
5 altered levels of protein product. "Antisense 

inhibition" refers to the production of antisense RNA 
transcripts capable of preventing the expression of the 
target protein. "Overexpression" refers to the 
production of a gene product in transgenic organisms 

10 that exceeds levels of production in normal or non- 
transformed organisms. "Cosuppression" refers to the 
expression of a foreign gene which has substantial 
homology to an endogenous gene resulting in the 
suppression of expression of both the foreign and the 

15 endogenous gene. "Altered levels" refers to the 

production of gene product (s) in transgenic organisms in 
amounts or proportions that differ from that of normal 
or non-transformed organisms. 

The n 3 T non-coding sequences" refers to the DNA 

20 sequence portion of a gene that contains a 

polyadenylation signal and any other regulatory signal 
capable of affecting mRNA processing or gene expression. 
The polyadenylation signal is usually characterized by 
affecting the addition of polyadenylic acid tracts to 

25 the 3* end of the mRNA precursor + 

The "translation leader sequence" refers to that 
DNA sequence portion of a gene between the promoter and 
coding sequence that is transcribed into RNA and is 
present in the fully processed mRNA upstream (5 1 ) of the 

30 translation start codon. The translation leader 

sequence may affect processing of the primary transcript 
to mRNA, mRNA stability or translation efficiency. 

"Mature" protein refers to a post-trans lationally 
processed polypeptide without its targeting signal. 

35 "Precursor" protein refers to the primary product of 
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translation of mRNA, A "chloroplast targeting signal" 
is an amino acid sequence which is translated in 
conjunction with a protein and directs it to the 
chloroplast. "Chloroplast transit sequence" refers to a 
5 nucleotide sequence that encodes a chloroplast targeting 
signal. 

"Transformation" herein refers to the transfer of a 
foreign gene into the genome of a host organism and its 
genetically stable inheritance. Examples of methods of 
10 plant transformation include Aarobacterium -mediated 

transformation and particle-accelerated or "gene gun" 
transformation technology. 

Isolation of AK Genes 
The IL. coli lysC gene has been cloned, restriction 

15 endonuclease mapped and sequenced previously [Cassan et 
al. (1986) J. Biol. Chem. 261:1052-1057]. For the 
present invention the lysC gene was obtained on a 
bacteriphage lambda clone from an ordered library of 
3400 overlapping segments of cloned £. coli DNA 

20 constructed by Kohara, Akiyama and Isono [Kohara et al. 
(1987) Cell 50:595-508] . The £. soli lys£ gene encodes 
the enzyme AKIII, which is sensitive to lysine 
inhibition. Mutations were obtained in the lysC gene 
that cause the AKIII enzyme to be resistant to lysine. 

25 To determine the molecular basis for lysine- 

resistance, the sequence of the wild type lysC gene and 
three mutant genes were determined. The sequence of the 
cloned wild type lysC gene, indicated in SEQ ID NO:!:, 
differed from the published lysC sequence in the coding 

30 region at 5 positions. 

The sequences of the three mutant lysC genes that 
encoded lysine-insensitive aspartokinase each differed 
from the wild type sequnce by a single nucleotide, 
resulting in a single amino acid substitution in the 

35 protein. One mutant (M2) had an A substituted for a G 
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at nucleotide 954 of SEQ ID NO:l: resulting in an 
isoleucine for methionine substitution in the amino acid 
sequence of AKIII and two mutants (M3 and M4) had 
identical T for C substitutions at nucleotide 1055 of 
5 SEQ ID NO:l resulting in an isoleucine for threonine 
substitution . 

Other mutations could be generated, either in vivo 
as described in Example 1 or in vitro by site-directed 
mutagenesis by methods known to those skilled in the 

10 art, that result in amino acid substitutions for the 

methionine or threonine residue present in the wild type 
AKIII at these positions. Such mutations would be 
expected to result in a lysine— insensitive enzyme. 
Furthermore, the method described in Example 1 could be 

15 used to easily isolate and characterize as many 

additional mutant lysC genes encoding lysine insensitive 
AKIII as desired. 

A number of other AK genes have been isolated and 
sequenced. These include the thrA gene of £. coli 

20 (Katinka et al. (1980) Proc. Natl. Acad. Sci. USA 

77:5730-5733] r the metL gene of E.. soli (Zakin et al. 
(1983) J. Biol. Chem. 258:3028-3031], the HQMi gene of 
£. cerevisiae [Rafalski et al. (1988) J. Biol. Chem. 
263:2146-2151] . The thrA gene of £. coli encodes a 

25 bifunctional protein, AKI-HDHI. The AK activity of this 
enzyme is insensitive to lysine, but sensitive to 
threonine. The metL gene of ££li also encodes a 
bifunctional protein, AKII-HDHII, and the AK activity of 
this enzyme is also insensitive to lysine. The HOM3 

30 gene of yeast encodes an AK which is insensitive to 
lysine, but sensitive to threonine. 

In addition to these genes, several plant genes 
encoding lysine- insensitive AK are known. In barley 
lysine plus threonine- resistant mutants bearing 

35 mutations in two unlinked genes that result in two 
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different lysine-insensitive AK isoenzymes have been 
described [Bright et al. (1982) Nature 299:278-279, 
Rognes et al. (1983) Planta 157:32-38, Arruda et al. 
(1984) Plant Phsiol. 76:442-446]. In com, a lysine 
5 plus threonine-resistant cell line had AK activity that 
was less sensitive to lysine inhibition than its parent 
line [Hibberd et al. (1980) Planta 148:183-187]. A 
subsequently isolated lysine plus threonine-resistant 
corn mutant is altered at a different genetic locus and 
10 also produces lysine-insensitive AK [Diedrick et al. 
(1990) Theor. Appl. Genet, 79:209-215, Dotson et al. 

(1990) Planta 182:546-552]. In tobacco there are two AK 
enzymes in leaves, one lysine-sensitive and one 
threonine-sensitive. A lysine plus threonine-resistant 

15 tobacco mutant that expressed completely lysine- 
insensitive AK has been described [Frankard et al. 

(1991) Theor. Appl. Genet. 82:273-282]. These plant 
mutants could serve as sources of genes encoding lysine- 
insensitive AK and used, based on the teachings herein, 

20 to increase the accumulation of lysine and threonine in 
the seeds of transformed plants. 

A partial amino acid sequence of AK from carrot has 
been reported [Wilson et al. (1991) Plant Physiol. 
97:1323:1328]. Using this information a set of 

25 degenerate DNA oligonucleotides could be designed, 

synthesized and used as hybridization probes to permit 
the isolation of the carrot AK gene. Recently the 
carrot AK gene has been isolated and its nucleotide 
sequence has been determined [Matthews et al. (1991) 

30 U.S. S.N. 07/746,705]. This gene can be used as a 

heterologous hybridization probe to isolate the genes 
encoding lysine-insensitive AK described above. 
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Vfi gh level expression of wild type and 
mutant lysC genes in E. coli 
To achieve high level expression of the lysC genes 
in £. coli , a bacterial expression vector which employs 
5 the bacteriphage T7 RNA polymerase/T7 promoter system 
[Rosenberg et al. (1987) Gene 56:125-135] was used. The 
expression vector and lysC gene were modified as 
described in Example 2 to construct a lysC expression 
vector. For expression of the mutant lysC genes (M2, M3 

10 and M4), the wild type lysC gene was replaced with the 
mutant genes as described in Example 2. 

For high level expression, each of the expression 
vectors was transformed into E. qq1± strain B121(DE3) 
[Studier et al. (1986) J. Mol. Biol. 189:113-130}. 

15 Cultures were grown, expression was induced, cells were 
collected, and extracts were prepared as described in 
Example 2. Supernatant and pellet fractions of extracts 
from uninduced and induces cultures were analyzed by SDS 
polyacrylamide gel electrophoresis and by AK enzyme 

20 assays as described in Example 2. The major protein 
visible by Coomassie blue staining in the supernatant 
and pellet fractions of induced cultures was AKIII. 
About 80% of the AKIII protein was in the supernatant 
and AKIII represented 10-20% of the total £. coli 

25 protein in the extract. 

Approximately 80% of the AKIII enzyme activity was 
in the supernatant fraction. The specific activity of 
wild type and mutant crude extracts was 5-7 jimoles 
product per minute per milligram total protein. Wild 

30 type AKIII was sensitive to the presence of L-lysine in 
the assay. Fifty percent inhibition was found at a 
concentration of about 0.4mM and 90 percent inhibition 
at about O.lmM. In contrast, mutants AKIII-M2, M3 and 
M4 were not inhibited at all by 15mM L-lysine. 
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Wild type AKIII protein was purified from the 
supernatant of an induced culture as described in 
Example 2 . Rabbit antibodies were raised against the 
purified AKIII protein. 
5 Many other microbial expression vectors have been 

described in the literature. One skilled in the art 
could make use of any of these to construct lysC 
expression vectors. These lysp expression vectors could 
then be introduced into appropriate microorganisms via 

10 transformation to provide a system for high level 
expression of AKIII. 

Isolation of PHD PS genes 
The £. c oJLi dapA gene (ecodapA) has been cloned, 
restriction endonuclease mapped and sequenced previously 

15 [Richaud et al. (1986) J. Bacteriol. 166:297-300]. For 
the present invention the da pA gene was obtained on a 
bacteriophage lambda clone from an ordered library of 
3400 overlapping segments of cloned £. coli . DNA 
constructed by Kohara, Akiyama and Isono [Kohara et al. 

20 (1987) Cell 50:595-508]. The ecodaE& gene encodes a 
DHDPS enzyme that is sensitive to lysine inhibition. 
However, it is about 20-fold less sensitive to 
inhibition by lysine than a typical plant DHDPS, e.g., 
wheat germ DHDPS. 

25 The Corynebacterium dapA gene (cordapA) was 

isolated from genomic DNA from ATCC strain 13032 using 
polymerase chain reaction (PCR) . The nucleotide 
sequence of the Corynebacterium dapA gene has been 
published [Bonnassie et al. (1990) Nucleic Acids Res. 

30 18:6421]. From the sequence it was possible to design 
oligonucleotide primers for polymerase chain reaction 
(PCR) that would allow amplification of a DNA fragment 
containing the gene, and at the same time add unique 
restriction endonuclease sites at the start codon and 

35 just past the stop codon of the gene to facilitate 
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further constructions involving the gene. The details 
of the isolation of the cor dapA gene are presented in 
Example 3. The co rdapA gene encodes a DHDPS enzyme that 
is insensitive to lysine inhibit on. 
5 In addition to introducing a restriction 

endonuclease site at the translation start codon, the 
PCR primers also changed the second codon of the cordaEE 
gene from AGC coding for serine to GCT coding for 
alanine. Several cloned DNA fragments that expressed 

10 active, lysine-insensitive DHDPS were isolated, 

indicating that the second codon amino acid substitution 
did not affect enzyme activity. 

The PCR-generated Corynebacterium dapA gene was 
subcloned into the phagemid vactor pGEM-9zf (-) from 

15 Promega, and single-stranded DNA was generated and 

sequenced (SEQ ID NO: 6) . Aside from the differenced in 
the second codon already mentioned, the sequence matched 
the published sequence except at two positions, 
nucleotides 798 and 799. In the published sequence 

20 these are TC, while in the gene shown in SEQ ID NO: 6 
they are CT. This change results in an amino acid 
substitution of leucine for serine. The reason for this 
difference is not known. The difference has no apparent 
effect on DHDPS enzyme activity. 

25 The isolation of other genes encoding DHDPS has 

been described in the literature. A cDNA encoding DHDPS 
from wheat [Kaneko et al. (1990) J. Biol. Chem. 
265:17451-17455], and a cDNA encoding DHDPS from corn 
[Frisch et al. (1991) Mol. Gen. Genet. 228:287-293] are 

30 two examples. These genes encode wild type lysine- 
sensitive DHDPS enzymes. However, Negrutui et al. 
[(1984) Theor. Appl. Genet. 68:11-20], obtained two AEC- 
resistant tobacco mutants in which DHDPS activity was 
less sensitive to lysine inhibition than the wild type 

35 enzyme. These genes could be isolated using the methods 
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already described for isolating the wheat or corn genes 
or, alternatively, by using the wheat or corn genes as 
heterologous hybridization probes. 

Still other genes encoding DHDPS could be isolated 
by one skilled in the art by using either the eco dapA 
gene, the cordapA gene, or either of the plant DHDPS 
genes as DNA hybridization probes. Alternatively, other 
genes encoding DHDPS could be isolated by functional 
complementation of an E, coli dapA mutant, as was done 
to isolate the cordapA gene [Yeh et al. (1988) Mol. Gen. 
Genet. 212:105-111] and the corn DHDPS gene. 

High level expression of ecodapA and 
cordapA genes in L coli 

To achieve high level expression of the ec odapA and 
cordapA genes in £. a bacterial expression vector 

which employs the bacteriophage T7 RNA polymerase/T7 
promoter system [Rosenberg et al. (1987) Gene 
56:127-135] was used. The vector and dapA genes were 
modified as described below to construct eco dapA and 
cor^ap^ expression vectors. 

For high level expression each of the expression 
vectors was transformed into E.. colj , strain BL21(DE3) 
[Studier et al. (1986) J. Mol. Biol. 189:113-130]. 
Cultures were grown, expression was induced, cells were 
collected, and extracts were prepared as described in 
Example 4 . Supernatant and pellet fractions of extracts 
from uninduced and induced cultures were analyzed by SDS 
polyacrylamide gel electrophoresis and by DHDPS enzyme 
assays as described in Example 4 . The major protein 
visible by Coomassie blue staining in the supernatant 
and pellet fractions of both induced cultures had a 
molecular weight of 32-34 kd, the expected size for 
DHDPS. Even in the uninduced cultures this protein was 
the most prominent protein produced. 



WO 93/19190 



22 



PCT/US93/02480 



In the induced culture with the ecoil2E& gene about 
80% of the DHDPS protein was in the supernatant and 
DHDPS represented 10-20% of the total protein in the 
extract. In the induced culture with the cord&Ek gene 
5 more than 50% of the DHDPS protein was in the pellet 
fraction. The pellet fractions in both cases were 
90-95% pure DHDPS, with no other single protein present 
in significant amounts. Thus, these fractions were pure 
enough for use in the generation of rabbit antibodies. 

10 The specific activity of £. coli DHDPS in the 

supernatant fraction of induced extracts was about 50 
OD540 units per milligram protein. £. qqIx DHDPS was 
sensitive to the presence of L-lysine in the assay. 
Fifty percent inhibition was found at a concentration of 

15 about O.SmM. For Corynebacterium DHDPS, enzyme activity 
was measured in the supernatant fraction of uninduced 
extracts, rather than induced extracts. Enzyme activity 
was about 4 OD530 units per minute per milligram protein. 
In contrast to £. coli DHDPS, Corynebacterium DHDPS was 

20 not inhibited at all by L-lysine, even at a 
concentration of 70mM. 

Many other microbial expression vectors have been 
described in the literature. One skilled in the art 
could make use of any of these to construct ecodaea or 

25 co rdapA expression vectors. These expression vectors 

could then be introduced into appropriate microorganisms 
via transformation to provide a system for high level 
expression of DHDPS. 

Excretion of am ino acids by E. coli expressing 

30 higrh levels of DHDPS and/or AKIII 

The £. coli expression cassettes were inserted into 
expression vectors and then transformed into £. 
strain BL21 (DE3) [Studier et al. (1986) J. Mol. Biol. 
189:113-130] to induce £. qqIx to produce and excrete 
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amino acids. Details of the procedures used and results 
are presented in Example 5. 

Other microbial expression vectors known to those 
skilled in the art could be used to make and combine 
expression cassettes for the lysC and dapA genes. These 
expression vectors could then be introduced into 
appropriate microorganisms via transformation to provide 
alternative systems for production and excretion of 
lysine, theronine and methionine. 

Construction of Chimeric Genes for Expression of 
lvsC and daoA Coding Regions in Plants 

The expression of foreign genes in plants is well- 
established [De Blaere et al. (1987) Meth. Enzymol. 
143:277-291]. Proper level of expression of lysC and 
tiapA mRNAs may require the use of different chimeric 
genes utilizing different promoters. Such chimeric 
genes can be transferred into host plants either 
together in a single expression vector or sequentially 
using more than one vector. A preferred class of 
heterologous hosts for the expression of the coding 
sequence of lysC and dapA genes are eukaryotic hosts, 
particularly the cells of higher plants. Particularly 
preferred among the higher plants and the seeds derived 
from them are soybean, rapeseed ( Brassica napus P 
B. sampestris) , sunflower (Helianthus annus) r cotton 
(Gosgypiym hixSUtum) , corn, tobacco ( Nicotiana Tuba cum ) , 
alfalfa (Medicego sativa) , wheat ( Triticum sp) , barley 
(Bprcteum VUlgare) , oats ( Avena sativa P L) , sorghum 
(Sorghum bicolor) , rice f Qryza satim) , and forage 
grasses. Expression in plants will use regulatory 
sequences functional in such plants. 

The origin of promoter chosen to drive the 
expression of the coding sequence is not critical as 
long as it has sufficient transcriptional activity to 
accomplish the invention by expressing translatable mRNA 
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for lysC or dapA genes in the desired host tissue. 
Preferred promoters for expression in all plant organs , 
and especially for expression in leaves include those 
directing the 19S and 35S transcripts in Cauliflower 
5 mosaic virus [Odell et al. (1985) Nature 313:810-812; 
Hull et al. (1987) Virology 86:482-493], small subunit 
of ribulose 1, 5-bisphosphate carboxylase [Morelli 
et al.(1985) Nature 315:200; Broglie et al. (1984) 
Science 224:838; Hererra-Estrella et al. (1984) Nature 
10 310:115; Coruzzi et al.(1984) EMBO J* 3:1671; Faciotti 
et al. (1985) Bio/Technology 3:241], maize zein protein 
[Matzke et al. (1984) EMBO J. 3:1525], and chlorophyll 
a/b binding protein [Lampa et al. (1986) Nature 
316:750-752] . 

15 Depending upon the application, it may be desirable 

to select promoters that are specific for expression in 
one or more organs of the plant. Examples include the 
light-inducible promoters of the small subunit of 
ribulose 1, 5-bisphosphate carboxylase, if the expression 

20 is desired in photosynthetic organs, or promoters active 
specifically in seeds. 

Preferred promoters are those that allow expression 
of the protein specifically in seeds. This may be 
especially useful, since seeds are the primary source of 

25 vegetable amino acids and also since seed-specific 

expression will avoid any potential deleterious effect 
in non-seed organs. Examples of seed-specific promoters 
include, but are not limited to, the promoters of seed 
storage proteins. The seed storage proteins are 

30 strictly regulated, being expressed almost exclusively 
in seeds in a highly organ-specific and stage-specific 
manner [Higgins et al. (1984) Ann. Rev. Plant Physiol. 
35:191-221; Goldberg et al. (1989) Cell 56:149-160; 
Thompson et al. (1989) BioEssays 10:108-113] . Moreover, 
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different seed storage proteins may be expressed at 
different stages of seed development. 

There are currently numerous examples for seed- 

* specific expression of seed storage protein genes in 

5 transgenic dicotyledonous plants. These include genes 

* from dicotyledonous plants for bean p-phaseolin 

[Sengupta-Goplalan et al. (1985) Proc. Natl. Acad. Sci. 
USA 82:3320-3324; Hoffman et al. (1988) Plant Mol. Biol. 
11:717-729], bean lectin [Voelker et al. (1987) EMBO J. 
10 6: 3571-3577], soybean lectin [Okamuro et al. (1986) 

Proc. Natl. Acad. Sci. USA 83:8240-8244], soybean kunitz 
trypsin inhibitor [Perez-Grau et al. (1989) Plant Cell 
1:095-1109], soybean p-conglycinin [Beachy et al. (1985) 

EMBO J. 4:3047-3053; Barker et al . (1988) Proc. Natl. 

15 Acad. Sci. USA 85:458-462; Chen et al. (1988) EMBO J. 
7:297-302; Chen et al. (1989) Dev. Genet. 10:112-122; 
Naito et al. (1988) Plant Mol. Biol. 11:109-123], pea 
vicilin [Higgins et al. (1988) Plant Mol. Biol. 
11:683-695], pea convicilin [Newbigin et al. (1990) 

20 Planta 180:461], pea legumin [Shirsat et al. (1989) Mol. 
Gen. Genetics 215:326]; rapeseed napin [Radke et al. 
(1988) Theor. Appl. Genet. 75:685-694] as well as genes 
from monocotyledonous plants such as for maize 15 kD 
zein [Hoffman et al. (1987) EMBO J. 6:3213-3221; 

25 Schernthaner et al. (1988) EMBO J. 7:1249-1253; 

Williamson et al . (1988) Plant Physiol. 88:1002-1007], 
barley JJ-hordein [Marris et al. (1988) Plant Mol. Biol. 

10:359-366] and wheat glutenin [Colot et al. (1987) EMBO 
J. 6:3559-3564]. Moreover, promoters of seed-specific 

^ 30 genes, operably linked to heterologous coding sequences 

in chimeric gene constructs, also maintain their 

s temporal and spatial expression pattern in transgenic 

plants. Such examples include Arabidopsis thaliana 2S 
seed storage protein gene promoter to express enkephalin 
35 peptides in Arabidopsis and £. na pus seeds 
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[Vandekerckhove et al. (1989) Bio/Technology 7:929-932], 
bean lectin and bean p-phaseolin promoters to express 

luciferase [Riggs et al. (1989) Plant Sci. 63:47-57] , 
and wheat glutenin promoters to express chloramphenicol 
5 acetyl transferase [Colot et al. (1987) EMBO J. 
6:3559-3564] . 

Of particular use in the expression of the nucleic 
acid fragment of the invention will be the heterologous 
promoters from several extensively-characterized soybean 

10 seed storage protein genes such as those for the Kunitz 
trypsin inhibitor [Jofuku et al. (1989) Plant Cell 
1:1079-1093; Perez-Grau et al. (1989) Plant Cell 
1:1095-1109], glycinin [Nielson et al. (1989) Plant Cell 
1:313-328], p-conglycinin [Harada et al. (1989) Plant 

15 Cell 1:415-425] . Promoters of genes for a r - and 

P-subunits of soybean p-conglycinin storage protein will 
be particularly useful in expressing the lysC and dap& 
mRNAs in the cotyledons at mid- to late-stages of 
soybean seed development [Beachy et al. (1985) EMBO J. 

20 4:3047-3053; Barker et al. (1988) Proc. Natl. Acad. Sci. 
USA 85:458-462; Chen et al. (1988) EMBO J. 7:297-302; 
Chen et al. (1989) Dev. Genet. 10:112-122; Naito et al. 
(1988) Plant Mol. Biol. 11:109-123] in transgenic 
plants, since: a) there is very little position effect 

25 on their expression in transgenic seeds, and b) the two 
promoters show different temporal regulation: the 
promoter for the a f -subunit gene is expressed a few days 
before that for the P-subunit gene. 

Also of particular use in the expression of the 

30 nucleic acid fragments of the invention will be the 
heterologous promoters from several extensively 
characterized corn seed storage protein genes such as 
endosperm-specific promoters from the 10 kD zein 
[Kirihara et al. (1988) Gene 71:359-370], the 27 kD zein 

35 [Prat et al. (1987) Gene 52:51-49; Gallardo et al. 
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(1988) Plant Sci. 54:211-281], and the 19 kD zein [Marks 
et al. (1985) J. Biol. Chem. 260:16451-16459], The 
relative transcriptional activities of these promoters 
in corn have been reported [Kodrzyck et al. (1989) Plant 
5 Cell 1:105-114] providing a basis for choosing a 

promoter for use in chimeric gene constructs for corn. 
For expression in corn embryos, the strong embryo- 
specific promoter from the GLB1 gene [Kriz (1989) 
Biochemical Genetics 27:239-251, Wallace et al. (1991) 

10 Plant Physiol. 95:973-975] can be used. 

It is envisioned that the introduction of enhancers 
or enhancer-like elements into other promoter constructs 
will also provide increased levels of primary 
transcription for lysC and dapA genes to accomplish the 

15 invention. These would include viral enhancers such as 
that found in the 35S promoter [Odell et al. (1988) 
Plant Mol. Biol. 10:263-272], enhancers from the opine 
genes [Fromm et al. (1989) Plant Cell 1:977-984], or 
enhancers from any other source that result in increased 

20 transcription when placed into a promoter operably 

linked to the nucleic acid fragment of the invention. 

Of particular importance is the DNA sequence 
element isolated from the gene for the <x f -subunit of 
fj-conglycinin that can confer 40-fold seed-specific 

25 enhancement to a constitutive promoter [Chen et al. 

(1988) EMBO J. 7:297-302; Chen et al. (1989) Dev. Genet. 
10:112-122]. One skilled in the art can readily isolate 
this element and insert it within the promoter region of 
any gene in order to obtain seed-specific enhanced 

30 expression with the promoter in transgenic plants. 

Insertion of such an element in any seed-specific gene 
that is expressed at different times than the 
ji-conglycinin gene will result in expression in 

transgenic plants for a longer period during seed 
3 5 de ve lopment . 
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Any 3 f non-coding region capable of providing a 
polyadenylation signal and other regulatory sequences 
that may be required for the proper expression of the 
lysC or dapA coding regions can be used to accomplish 
5 the invention. This would include the 3' end from any 
storage protein such as the 3 r end of the bean phaseolin 
gene, the 3 X end of the soybean p-conglycinin gene, the 

3' end from viral genes such as the 3' end of the 35S or 
the 19S cauliflower mosaic virus transcripts, the 3 1 end 

10 from the opine synthesis genes, the 3 1 ends of ribulose 
1, 5-bisphosphate carboxylase or chlorophyll a/b binding 
protein, or 3' end sequences from any source such that 
the sequence employed provides the necessary regulatory 
information within its nucleic acid sequence to result 

15 in the proper expression of the promoter /lyaC coding 
region combination to which it is operably linked. 
There are numerous examples in the art that teach the 
usefulness of different 3' non-coding regions [for 
example, see Ingelbrecht et al. (1989) Plant Cell 

20 1:671-680] . 

DNA sequences coding for intracellular localization 
sequences may be added to the lysC and dapA coding 
sequence if required for the proper expression of the 
proteins to accomplish the invention. Plant amino acid 

25 biosynthetic enzymes are known to be localized in the 
chloroplasts and therefore are synthesized with a 
chloroplast targeting signal. Bacterial proteins such 
as DHDPS and AKIII have no such signal. A chloroplast 
transit sequence could, therefore, be fused to the dapA 

30 and lysC coding sequences. Preferred chloroplast 

transit sequences are those of the small subunit of 
ribulose 1, 5-bisphosphate carboxylase, e.g. from soybean 
[Berry-Lowe et al. (1982) J. Mol. Appl. Genet. 
1:483-498] for use in dicotyledonous plants and from 
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corn [Lebrun et al. (1987) Nucleic Acids Res. 15:4360] 
for use in monocotyledonous plants. 

Introduction of lysC and dap A 

Chimeric Genes into Plants 
5 Various methods of introducing a DNA sequence 

(i.e., of transforming) into eukaryotic cells of higher 
plants are available to those skilled in the art (see 
EPO publications 0 295 959 A2 and 0 138 341 Al) . Such 
methods include those based on transformation vectors 

10 based on the Ti and Ri plasmids of Aarobacterium spp. 
It is particularly preferred to use the binary type of 
these vectors. Ti-derived vectors transform a wide 
variety of higher plants, including monocotyledonous and 
dicotyledonous plants , such as soybean, cotton and rape 

15 [Pacciotti et al. (1985) Bio/Technology 3:241; Byrne 

et al. (1987) Plant Cell, Tissue and Organ Culture 8:3; 
Sukhapinda et al . (1987) Plant Mol. Biol. 8:209-216; 
Lorz et al. (1985) Mol. Gen. Genet. 199:178; Potrykus 
(1985) Mol. Gen. Genet. 199:183]. 

20 For introduction into plants the chimeric genes of 

the invention can be inserted into binary vectors as 
described in Examples 7-12 and 14-16. The vectors are 
part of a binary Ti plasmid vector system [Bevan, (1984) 
Nucl. Acids. Res. 12:8711-8720] of Aarobacterium 

25 tvimefaciens. 

Other transformation methods are available to those 
skilled in the art, such as direct uptake of foreign DNA 
constructs [see EPO publication 0 295 959 A2], 
techniques of electroporation [see Fromm et al. (1986) 

30 Nature (London) 319:791] or high-velocity ballistic 

bombardment with metal particles coated with the nucleic 
acid constructs [see Kline et al. (1987) Nature (London) 
327:70, and see U.S. Pat. No. 4,945,050], Once 
transformed, the cells can be regenerated by those 

35 skilled in the art. 
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Of particular relevance are the recently described 
methods to transform foreign genes into commercially 
important crops, such as rapeseed [see De Block et al» 
(1989) Plant Physiol. 91: 694-701] , sunflower [Everett 
5 et al. (1987) Bio/Technology 5 : 1201] , soybean [McCabe 
et al. (1988) Bio/Technology 6:923; Hinchee et al. 
(1988) Bio/Technology 6:915; Chee et al. (1989) Plant 
Physiol. 91:1212-1218; Christou et al. (1989) Proc. 
Natl. Acad. Sci USA 86:7500-7504; EPO Publication 
10 0 301 749 A2], and corn [Gordon-Karam et al. (1990) Plant 
Cell 2:603-618; Fromm et al. (1990) Biotechnology 
8:833-839] . 

For introduction into plants by high- velocity 
ballistic bombardment , the chimeric genes of the 

15 invention can be inserted into suitable vectors as 
described in Example 6. Transformed plants can be 
obtained as described in Examples 17-19. 

Expression of lysC and dapft Chimeric Genes in Plants 
To assay for expression of the chimeric genes in 

20 leaves or seeds of the transformed plants , the AKIII or 
DHDPS proteins can be detected and quantitated 
enzymatically and/or immunologically by methods known to 
those skilled in the art. In this way lines producing 
high levels of expressed protein can be easily 

25 identified. 

In order to measure the free amino acid composition 
of the leaves, free amino acids can be extracted by 
various methods including those as described in 
Example 7. To measure the free or total amino acid 

30 composition of seeds, extracts can be prepared by 
various methods including those as described in 
Example 8. 

There was no significant effect of expression of 
AKIII or AKIII-M4 (with a chloroplast targeting signal) 
35 on the free lysine or threonine (or any other amino 
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acid) levels in the leaves (see Table 2 in Example 7) . 
Since AKIII-M4 is insensitive to feedback inhibition by 
any of the end-products of the pathway, this indicates 
that control must be exerted at other steps in the 
5 biosynthetic pathway in leaves. 

In contrast, expression of the AKIII or AKIII-M4 
(with a chloroplast targeting signal) in the seeds 
resulted in 2 to 4-fold or 4 to 23-fold increases, 
respectively, in the level of free threonine in the 

10 seeds compared to untransformed plants and 2 to 3-fold 
increases in the level of free lysine in some cases 
(Table 3, Example 8) . There was a good correlation 
between transformants expressing higher levels of AKIII 
or AKIII-M4 protein and those having higher levels of 

15 free threonine, but this was not the case for lysine. 
The relatively small increases of free threonine or 
lysine achieved with the AKIII protein were not 
sufficient to yield detectable increases compared to 
untransformed plants, in the levels of total threonine 

20 or lysine in the seeds. The larger increases of free 
threonine achieved via expression of the AKIII-M4 
protein were sufficient to yield detectable increases, 
compared to seeds from untransformed plants, in the 
levels of total threonine in the seeds . Sixteen to 

25 twenty-five percent increases in total threonine content 
of the seeds were observed. The lines that showed 
increased total threonine were the same ones the showed 
the highest levels of increase in free threonine and 
high expression of the AKIII-M4 protein. 

30 The above teachings show that amino acid 

biosynthesis takes place in seeds and can be modulated 
by the expression of foreign genes encoding amino acid 
biosynthetic enzymes. Furthermore, they show that 
control of an amino acid biosynthetic pathway can differ 

35 markedly from one plant organ to another, e.g. seeds and 
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leaves. The importance of this observation is 
emphasized upon considering the different effects of 
expressing a foreign DHDPS in leaves and seeds described 
below. It can be concluded that threonine biosynthesis * 
5 in seeds is controlled primarily via end-product 

inhibition of AK. Therefore, threonine accumulation in «■ 
the seeds of plants can be increased by expression of a 
gene, introduced via transformation, that encodes AK 
which is insensitive to lysine inhibition and which is 

10 localized in the chloroplast. 

The above teachings also demonstrate that 
transformed plants which express higher levels of the 
introduced enzyme in seeds accumulate higher levels of 
free threonine in seeds. Furthermore, the teachings 

15 demonstrate that transformed plants which express a 

lysine-insensitive AK in seeds accumulate higher levels 
of free threonine in seeds than do transformed plants 
which express similar levels of a lysine-sensitive AK. 
To achieve commercially valuable increases in free 

20 threonine, a completely lysine-insensitive AK is 
preferred. 

These teachings indicate that the level of free 
lysine in seeds controls the accumulation of another 
aspartate-derived amino acid, threonine, through end- 

25 product inhibition of AK. In order to accumulate high 
levels of free lysine itself, it will be necessary to 
bypass lysine inhibition of AK via expression of a 
lysine-insensitive AK. 

Expression of active £. coli DHDPS enzyme was 

30 achieved in both young and mature leaves of the T 
transformed tobacco plants (Table 4, Example 9} . High 
levels of free lysine, 50 to 100-fold higher than normal f 
tobacco plants, accumulated in the young leaves of the 
plants expressing the enzyme with a chloroplast 

35 targeting signal, but not without such a targeting 
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signal. However, a much smaller accumulation of free 
lysine (2 to 8-fold) was seen in the larger leaves . 
Experiments that measure lysine in the phloem suggest 
that lysine is exported from the large leaves. This 
5 exported lysine may contribute to the accumulation of 
lysine in the small growing leaves, which are known to 
take up, rather than export nutrients. No effect on the 
free lysine levels in the seeds of these plants was 
observed even though £. coli DHDPS enzyme was expressed 

10 in the seeds as well as the leaves. 

High level seed-specific expression of £. coli 
DHDPS enzyme, either with or without a chloroplast 
targeting signal, had no effect on the total, or free, 
lysine or threonine (or any other amino acid) 

15 composition of the seeds in any transformed line (Table 
5, Example 10) . These results demonstrate that 
expression in seeds of a DHDPS enzyme that is less 
sensitive to lysine than the plant enzyme is not 
sufficient to lead to increased production or 

20 accumulation of free lysine. 

These teachings from transf ormants expressing the 
£. coli DHDPS enzyme indicate that lysine biosynthesis 
in leaves is controlled primarily via end-product 
inhibition of DHDPS, while in seeds there must be at 

25 least one additional point of control in the pathway. 

The teachings from transformants expressing the E;. eg li 
AKIII and AKIII-M4 enzymes indicate that the level of 
free lysine in seeds controls the accumulation of all 
aspartate-derived amino acids through end-product 

30 inhibition of AK. AK is therefore an additional control 
point . 

To achieve simultaneous, high level expression of 
both E. DHDPS and AKIII-M4 in leaves and seeds, 

plants that express each of the genes could be crossed 
35 and hybrids that express both could be selected. 
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Another method would be to construct vectors that 
contain both genes on the same DNA fragment and 
introduce the linked genes into plants via 

transformation. This is preferred because the genes * 
5 would remain linked thoughout subsequent plant breeding 

efforts. Representative vectors carrying both genes on * 
the same DNA fragment are described in Examples 11, 12 , 
16 r and 18. 

Tobacco plants transformed with a vector carrying 

10 both E. £di DHDPS and AKIII-M4 genes linked to the 35S 
promoter are described in Example 11. In transformants 
that express little or no AKIII-M4, the level of 
expression of £. coli DHDPS determines the level of 
lysine accumulation in leaves (Example 11, Table 6) . 

15 However, in transformants that express both AKIII-M4 and 
fi. DHDPS, the level of expression of each protein 

plays a role in controlling the level of lysine 
accumulation. Transformed lines that express DHDPS at 
comparable levels accumulate more lysine when AKIII-M4 

20 is also expressed (Table 6, compare lines 564-18A, 564- 
56A, 564-36E, 564-55B, and 564-47A) . Thus, expression 
of a lysine- insensitive AK increases lysine accumulation 
in leaves when expressed in concert with a DHDPS enzyme 
that is 20-fold, less sensitive to lysine than the 

25 endogenous plant enzyme. 

These leaf results, taken together with the seed 
results derived from expressing £- coli AKIII-M4 and 
£. coli DHDPS separately in seeds, suggest that 
simultaneous expression of both £. £&LL AKIII-M4 and 

30 coli DHDPS in seeds would lead to increased ^ 

accumulation of free lysine and would also lead to an 
increased accumulation of free threonine. Tobacco 
plants transformed with a vector carrying both £. coli 
DHDPS and AKIII-M4 genes linked to the phaseolin 

35 promoter are described in Example 12. There is an 
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increased accumulation of free lysine and free threonine 
in these plants. The increased level of free threonine 
was 4-fold over normal seeds, rather than the 20-fold 

* increase seen in seeds expressing AKIII-M4 alone. The 

5 reduction in accumulation of free threonine indicates 

'* that pathway intermediates are being diverted down the 

lysine branch of the biosynthetic pathway. The 
increased level of free lysine was 2-fold over normal 
seeds (or seeds expressing £. coli DHDPS alone) . 
10 However, the lysine increase in seeds is not equivalent 
to the 100-fold increase seen in leaves. 

The £. coli DHDPS enzyme is less sensitive to 
lysine inhibition than plant DHDPS, but is still 
inhibited by lysine. The above teachings on the AK 
15 proteins indicate that expression of a completely 

lysine-insensitive enzyme can lead to a much greater 
accumulation of the aspartate pathway end-product 
threonine than expression of an enzyme which, while less 
sensitive than the plant enzyme, is still inhibited by 
20 lysine. Therefore vectors carrying both Corynebacterium 
DHDPS and AKIII-M4 genes linked to the seed-specific 
promoters were constructed as described in Examples 15 
and 19. Tobacco plants transformed with vectors 
carrying both Corynebacterium DHDPS and AKIII-M4 genes 
25 linked to seed-specific promoters are described in 

Example 15. As shown in Table 9, these plants did not 
show a greater accumulation of free lysine in seeds than 
previously described plants expressing the £. coli DHDPS 
enzyme in concert with the lysine-insensitive AK. In 

i 30 hindsight this result can be explained by the fact that 

lysine accumulation in seeds never reached a level high 

, enough to inhibit the £. coli DHDPS, so replacement of 

this enzyme with lysine-insensitive Corynebacterium 
DHDPS had no effect. 



WO 93/19190 



36 



PCT/US93/02480 



In transformed lines expressing high levels of 
E. £QlL AKIII-M4 and E. coli DHDPS or Corynebartierium 
DHDPS . it was possible to detect substantial amounts of 
a-aminoadipic acid. This compound is thought to be an 

5 intermediate in the catabolism of lysine in cereal 
seeds, but is normally detected only via radioactive 
tracer experiments due to its low level of accumulation. 
The discovery of high levels of this intermediate, 
comparable to levels of free amino acids, indicates that 

10 a large amount of lysine is being produced in the seeds 
of these transformed lines and is entering the catabolic 
pathway. The build-up of a-aminoadipic acid was not 
observed in transformants expressing only E. coli DHDPS 
or only AKIII-M4 in seeds . These results show that it 

15 is necessary to express both enzymes simultaneously to 
produce high levels of free lysine in seeds. To 
accumulate high levels of free lysine it may also be 
necessary to prevent lysine catabolism. Alternatively, 
it may be desirable to convert the high levels of lysine 

20 produced into a form that is insensitive to breakdown, 
e.g. by incorporating it into a di-, tri- or 
oligopeptide, or a lysine-rich storage protein. 

Isolation of ft Plant 

Lysine Ketoglutarate Reductase Gene 
25 There is little information yet available on lysine 

catabolism. in plants. Available evidence indicates that 
lysine is catabolized via the saccharopine pathway. The 
first enzymatic evidence for the existence of this 
pathway was the detection of lysine ketoglutarate 
30 reductase (LKR) activity in immature endosperm of 
developing maize seeds [Arruda et al, (1982) Plant 
Physiol. 69:988-989]. LKR catalyzes the first step in 
lysine catabolism, the condensation of L-lysine with 
cc-ketoglutarate into saccharopine using NADPH as a 

35 cof actor. LKR activity increases sharply from the onset 
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of endosperm development/ reaches a peak level at about 
20 days after pollination, and then declines [Arruda et 
al. (1983) Phytochemistry 22:2687-2689] . In order to 
prevent the catabolism of lysine it would be desirable 
5 to reduce or eliminate LKR expression or activity. This 
could be accomplished by cloning the LKR gene, preparing 
a chimeric gene to express antisense RNA (Eur. Patent 
Applic. No. 8*3112647.7) for LKR, and introducing the 
chimeric gene into plants via transformation. 

10 In order to clone the corn LKR* gene, RNA was 

isolated from developing seeds 19 days after 
pollination. This RNA was sent to Clontech 
Laboratories, Inc., (Palo Alto, CA) for the custom 
synthesis of a cDNA library in the vector Lambda Zap II. 

15 The conversion of the Lambda Zap II library into a 

phagemid library, and then into a plasmid library was 
accomplished following the protocol provided by 
Clontech. Once converted into a plasmid library the 
ampicillin-resistant clones obtained carry the cDNA 

20 insert in the vector pBluescript SK(-) . Expression of 
the cDNA is under control of the lacZ promoter on the 
vector. 

To select clones that carried the LKR gene, a 
specially designed E. coli host, DE126, was constructed 

25 as described in Example 20. DE126 has the genotype 
F'16/malE52: :Tnl0, arg", ilvA296, thrAHOl, metLlOO, 
lys£ + , X~, rpsL9 r maiTl, xyl-7, mtl-2, thi-1?, supE44?. 
Its growth is inhibited by 20 jlg/mL of L-lysine in a 
synthetic medium. Expression of LKR in DE126 is 

30 expected to reverse the growth inhibition by reducing 

the lysine concentration. Use of DE126 to select clones 
from the corn cDNA library that lead to lysine-resistant 
growth is described in Example 20. 

The corn LKR cDNA obtained in this way can be used 

35 as a DNA hybridization probe to identify and isolate LKR 
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genes from other plant species . A chimeric gene 
designed to express antisense RNA for LKR can be 
constructed by linking the LKR gene in reverse 

orientation to any of the plant promoter sequences * 
5 described above. Preferred promoters would be seed- 
specific promoters . For corn f a strong endosperm- * 
specific promoter, e.g., the 10 kD or 27 kD zein 
promoter, would be preferred. 

In order to obtain plants that express a chimeric 
10 gene for antisense LKR, as well as genes encoding 

lysine- insensitive AK and DHDPS, the antisense LKR gene 
could be linked to genes encoding lysine- insensitive AK 
and DHDPS and all three genes could be introduced into 
plants via transformation. Alternatively, the chimeric 
15 gene for antisense LKR could be introduced into 

previously transformed plants that express lysine- 
insensitive AK and DHDPS, or the antisense LKR gene 
could be introduced into normal plants and the 
transformants obtained could be crossed with plants that 
20 express lysine- insensitive AK and DHDPS. 

Pse of the cts/lysC-M4 Chimeric Gene as a 
Selectable Marker for Plant Transformation 

Growth of cell cultures and seedlings of many 
plants is inhibited by high concentrations of lysine 

25 plus threonine. Growth is restored by addition of 
methionine (or homoserine which is converted to 
methionine in vivo ) . Lysine plus threonine inhibition 
is thought to result from feedback inhibition of 
endogenous AK, which reduces flux through the pathway 

30 leading to starvation for methionine. In tobacco there f 
are two AK enzymes in leaves, one lysine-sensitive and 
one threonine sensitive. [Negrutui et al. (1984) Theor. 
Appl. Genet. 68:11-20] . High concentrations of lysine 
plus threonine inhibit growth of shoots from tobacco 

35 leaf disks and inhibition is reversed by addition of low 
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concentrations of methionine. Thus, growth inhibition 
is presumably due to inhibition of the two AK isozymes . 

Expression of active lysine and threonine 
insensitive AKIII-M4 also reverses lysine plus threonine 
growth inhibition (Table 2, Example 7) . There is a good 
correlation between the level of AKIII-M4 protein 
expressed and the resistance to lysine plus threonine. 
Expression of lysine-sensitive wild type AKIII does not 
have a similar effect. Since expression of the AKIII-M4 
protein permits growth under normally inhibitory 
conditions, a chimeric gene that causes expression of 
AKIII-M4 in plants can be used as a selectable genetic 
marker for transformation as illustrated in Examples 13 
and 17. 

EXAMPLES 

The present invention is further defined in the 
following Examples, in which all parts and percentages 
are by weight and degrees are Celsius, unless otherwise 
stated. It should be understood that these Examples, 
while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above 
discussion and these Examples, one skilled in the art 
can ascertain the essential characteristics of this 
invention, and without departing from the spirit and 
scope thereof, can make various changes and 
modifications of the invention to adapt it to various 
usages and conditions. 

EXAMPLE 1 

Isolation of the E . coli lvsC Gene and mutations 
in lvsC resulting in Ivsine-insensitive AKTTT 
The £. qqIx lysC gene has been cloned, restriction 
endonuclease mapped and sequenced previously [Cassan 
et al. (1986) J. Biol. Chem. 261:1052-1057], For the 
present invention the lysC gene was obtained on a 
bacteriophage lambda clone from an ordered library of 
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3400 overlapping segments of cloned £. DNA 
constructed by Kohara, Akiyama and Isono [Kohara et al. 
(1987) Cell 50:595-508]. This library provides a 

physical map of the whole £. coli chromosome and ties * 
5 the physical map to the genetic map. From the knowledge 

of the map position of lysC at 90 min. on the £. qqIL * 
genetic map [Theze et al. (1974) J. Bacteriol. 
117:133-143], the restriction endonuclease map of the 
cloned gene [Cassan et al. (1986) J. Biol. Chem. 

10 261:1052-1057], and the restriction endonuclease map of 
the cloned DNA fragments in the £. coli library [Kohara 
et al. (1987) Cell 50:595-508], it was possible to 
choose lambda phages 4E5 and 7A4 [Kohara et al. (1987) 
Cell 50:595-508] as likely candidates for carrying the 

15 lysC gene. The phages were grown in liquid culture from 
single plaques as described [see Current Protocols in 
Molecular Biology (1987) Ausubel et al. eds. John Wiley 
& Sons New York] using LE392 as host [see Sambrook 
et al. (1989) Molecular Cloning: a Laboratory Manual, 

20 Cold Spring Harbor Laboratory Press] . Phage DNA was 

prepared by phenol extraction as described [see Current 
Protocols in Molecular Biology (1987) Ausubel et al, 
eds. John Wiley & Sons New York] . 

From the sequence of the gene several restriction 

25 endonuclease fragments diagnostic for the lysC gene were 
predicted, including an 1860 bp EcoR I-Nhe I fragment, a 
2140 bp EcoR I-Xmn I fragment and a 1600 bp 
EcoR I-BamH I fragment. Each of these fragments was 
detected in both of the phage DNAs confirming that these 

30 carried the lysC gene. The EcoR I-Nhe I fragment was 
isolated and subcloned in plasmid pBR322 digested with 
the same enzymes, yielding an ampicillin-resistant, 
tetracycline-sensitive coli transf ormant . The 
plasmid was designated pBT436. 
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To establish that the cloned lysC gene was 
functional, pBT436 was transformed into fi. coli strain 
Gifl06Ml (£, qq1± Genetic Stock Center strain CGSC-5074) 
which has mutations in each of the three £. coli AK 
5 genes [Theze et al. (1974) J. Bacterid. 117:133-143], 
This strain lacks all AK activity and therefore requires 
diaminopimelate (a precursor to lysine which is also 
essential for cell wall biosynthesis), threonine and 
methionine. In the transformed strain all these 

10 nutritional requirements were relieved demonstrating 
that the cloned lysC gene encoded functional AKIII. 

Addition of lysine (or diaminopimelate which is 
readily converted to lysine in vivo) at a concentration 
of approximately 0.2mM to the growth medium inhibits the 

15 growth of Gifl06Ml transformed with pBT436. M9 media 
[see Sambrook et al. (1989) Molecular Cloning: a 
Laboratory Manual, Cold Spring Harbor Laboratory Press] 
supplemented with the arginine and isoleucine, required 
for Gifl06Ml growth, and ampicillin, to maintain 

20 selection for the pBT436 plasmid, was used. This 

inhibition is reversed by addition of threonine plus 
methionine to the growth media. These results indicated 
that AKIII could be inhibited by exogenously added 
lysine leading to starvation for the other amino acids 

25 derived from aspartate. This property of pBT436- 

transformed Gifl06Ml was used to select for mutations in 
lysC that encoded lysine-insensitive AKIII. 

Single colonies of Gifl06Ml transformed with pBT-436 
were picked and resuspended in 200^lL of a mixture of 

30 100M-L 1% lysine plus lOO^iL of M9 media. The entire cell 
suspension containing 10 7 -10 8 cells was spread on a 
petri dish containing M9 media supplemented with the 
arginine, isoleucine, and ampicillin. Sixteen petri 
dishes were thus prepared. From 1 to 20 colonies 

35 appeared on 11 of the 16 petri dishes. One or two (if 
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available) colonies were picked and retested for lysine 
resistance and from this nine lysine-resistant clones 
were obtained. Plasmid DNA was prepared from eight of 
these and re-transformed into Gifl06Ml to determine 
5 whether the lysine resistance determinant was plasmid- 
borne. Six of the eight plasmid DNAs yielded lysine- 
resistant colonies. Three of these six carried lysC 
genes encoding AKIII that was uninhibited by 15mM 
lysine, whereas wild type AKIII is 50% inhibited by 

10 0.3-0.4mM lysine and >90% inhibited by ImM lysine (see 
Example 2 for details) . 

To determine the molecular basis for ly sine- 
resistance the sequences of the wild type lysC gene and 
three mutant genes were determined. A method for "Using 

15 mini-prep plasmid DNA for sequencing double stranded 
templates with sequenase™" [Kraft et al. (1988) 
BioTechniques 6:544-545] was used. Oligonucleotide 
primers , based on the published lysC sequence and spaced 
approximately every 200 bp, were synthesized to 

20 facilitate the sequencing. The sequence of the wild 
type lysC gene cloned in pBT436 (SEQ ID NO:l) differed 
from the published lysC sequence in the coding region at 
5 positions. Four of these nucleotide differences were 
at the third position in a codon and would not result in 

25 a change in the amino acid sequence of the AKIII 
protein. One of the differences would result in a 
cysteine to glycine substitution at amino acid 58 of 
AKIII. These differences are probably due to the 
different strains from which the lysC genes were cloned. 

30 The sequences of the three mutant lysC genes that 

encoded lysine-insensitive AK each differed from the 
wild type sequence by a single nucleotide, resulting in 
a single amino acid substitution in the protein. Mutant 
M2 had an A substituted for a G at nucleotide 954 of 

35 SEQ ID NO:l resulting in an isoleucine for methionine 



WO 93/19190 



PCT/US93/02480 



43 



substitution at amino acid 318 and mutants M3 and M4 had 
identical T for C substitutions at nucleotide 1055 of 
SEQ ID NO:l resulting in an isoleucine for threonine 
substitution at amino acid 352. Thus, either of these 
5 single amino acid substitutions is sufficient to render 
the AKIII enzyme insensitive to lysine inhibition. 

EXAMPLE 2 

High level expression of wild type and 
mutant lysC genes in E > coJLi 
10 An Nco I (CCATGG) site was inserted at the 

translation initiation codon of the lysC gene using the 
following oligonucleotides: 

SEQ ID NO: 2: 
15 GATCCATGGC TGAAATTGTT GTCTCCAAAT TTGGCG 



SEQ ID NO: 3: 

GTACCGCCAA ATTTGGAGAC AACAATTTCA GCCATG 



20 When annealled these oligonucleotides have BamH I and 
Asp718 "sticky" ends. The plasmid pBT436 was digested 
with BamH I, which cuts upstream of the lysC coding 
sequence and Asp718 which cuts 31 nucleotides downstream 
of the initiation codon. The annealled oligonucleotides 

25 were ligated to the plasmid vector and £. coli 

transformants were obtained. Plasmid DNA was prepared 
and screened for insertion of the oligonucleotides based 
on the presence of an Nco I site. A plasmid containing 
the site was sequenced to assure that the insertion was 

30 correct, and was designated pBT457. In addition to 

creating an Nco I site at the initiation codon of lysC r 
this oligonucleotide insertion changed the second codon 
from TCT, coding for serine, to GCT, coding for alanine. 
This amino acid substitution has no apparent effect on 

35 the AKIII enzyme activity. 
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To achieve high level expression of the lysC genes 
in £. coli r the bacterial expression vector pBT430 was 
used. This vector is a derivative of pET-3a [Rosenberg 
et al. (1987) Gene 56:125-135] which employs the 
5 bacteriophage T7 RNA polymerase/T7 promoter system. 
Plasmid pBT430 was constructed by first destroying the 
EcoR I and Hind III sites in pET-3a at their original 
positions. An oligonucleotide adaptor containing EcoR I 
and Hind III sites was inserted at the BamH I site of 

10 pET-3a. This created pET-3aM with additional unique 

cloning sites for insertion of genes into the expression 
vector. Then, the Nde I site at the position of 
translation initiation was converted to an Nco I site 
using oligonucleotide-directed mutagenesis . The DNA 

15 sequence of pET-3aM in this region, 5'- CATATG G, was 
converted to 5 ' - CCCATGG in pBT430. 

The lysC gene was cut out of plasmid pBT457 as a 
1560 bp Nco I-EcoR I fragment and inserted into the 
expression vector pBT430 digested with the same enzymes, 

20 yielding plasmid pBT461. For expression of the mutant 
lysC genes (M2, M3 and M4) pBT461 was digested with 
Kpn I-EcoR I, which removes the wild type lysC gene from 
about 30 nucleotides downstream from the translation 
start codon, and inserting the analogous Kpn I-EcoR I 

25 fragments from the mutant genes yielding plasmids 
pBT490, pBT491 and pBT492, respectively. 

For high level expression each of the plasmids was 
transformed into £. solx strain BL21(DE3) [Studier 
et al. (1986) J. Mol. Biol. 189:113-130]. Cultures were 

30 grown in LB medium containing ampicillin (100 rag/L) at 
25°C. At an optical density at 600 nm of approximately 
1, IPTG (isopropylthio-p-galactoside, the inducer) was 

added to a final concentration of 0,4 mM and incubation 
was continued for 3 h at 25°C. The cells were collected 
35 by centrifugation and resuspended in l/20th (or l/100th) 
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the original culture volume in 50 mM NaCl; 50 mM 

Tris-Cl, pH 7.5; 1 mM EDTA, and frozen at -20°C. Frozen 

aliquots of 1 mL were thawed at 37°C and sonicated/ in 

an ice-water bath, to lyse the cells. The lysate was 
5 centrifuged at 4°C for 5 min at 15,000 rpm. The 

supernatant was removed and the pellet was resuspended 

in 1 mL of the above buffer. 

The supernatant and pellet fractions of uninduced 

and IPTG-induced cultures of BL21 (DE3) /pBT4 61 were 
10 analyzed by SDS polyacrylamide gel electrophoresis. The 

major protein visible by Coomassie blue staining in the 

supernatant of the induced culture had a molecular 

weight of about 48 kd, the expected size for AKIII. 

About 80% of the AKIII protein was in the supernatant 
15 and AKIII represented 10-20% of the total £. coli 

protein in the extract. 

AK activity was assayed as shown below: 

Assay mix (for 12 assay tubes) : 

4.5 mL H 2 0 
20 1.0 mL 8M KOH 

1.0 mL 8M NH 2 OH-HCl 

1.0 mL 1M Tris-HCl pH 8.0 

0.5 mL 0.2M ATP (121 mg/mL in 0.2M NaOH) 

50 JXL 1M MgS04 

25 

Each 1.5 mL eppendorf assay tube contained: 
0.64 mL assay mix 

0.04 mL 0.2M L-aspartic acid or 0.04 mL H2O 
0.0005-0.12 mL extract 
30 H2O to total volume 0.8 mL 

Assay tubes were incubated at 30°C for desired time 
(10-60 min) . Then 0.4-mL FeCl3 reagent (10% w/v FeCl3, 
3.3% trichloroacetic acid, 0.7M HC1) was added and the 
35 material centrifuged for 2 min in an eppendorf 
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centrifuge. The supernatant was decanted. The OD was 
read at 540nm and compared to the aspartyl-hydroxamate 
standard . 

Approximately 80% of the AKIII activity was in the 
5 supernatant fraction. The specific activity of wild 
type and mutant crude extracts was 5-7 pinoles product 
per min per milligram total protein. Wild type AKIII 
was sensitive to the presence of L-lysine in the assay. 
Fifty percent inhibition was found at a concentration of 

10 about 0.4mM and 90 percent inhibition at about l.OmM. 

In contrast, mutants AKIII-M2, M3 and M4 (see Example 1) 
were not inhibited at all by 15mM L- lysine. 

Wild type AKIII protein was purified from the 
supernatant of the IPTG- induced culture as follows. To 

15 1 mL of extract, 0.25 mL of 10% streptomycin sulfate was 
added and kept at 4°C overnight. The mixture was 
centrifuged at 4°C for 15 min at 15 , 000 rpm. The 
supernatant was collected and desalted using a Sephadex 
G-25 M column (Column PD-10, Pharmacia) . It was then 

20 run on a Mono-Q HPLC column and eluted with a 0-1M NaCl 
gradient. The two 1 mL fractions containing most of the 
AKIII activity were pooled, concentrated, desalted and 
run on an HPLC sizing column (TSK G3000SW) . Fractions 
were eluted in 20mM KPO4 buffer, pH7.2, 2mM MgSCU, lOmM 

25 P-mercaptoethanol, 0,15M KC1, 0.5mM L-lysine and were 
found to be >95% pure by SDS polyacrylamide gel 
electrophoresis. Purified AKIII protein was sent to 
Hazelton Research Facility (310 Swampridge Road, Denver, 
PA 17517) to have rabbit antibodies raised against the 

30 protein. 

EXAMPLE 3 

Isolation of the E. coli and Corynebacterium 

glutamic™ dapA genes 
The E. coli dapA gene (ecodapA) has been cloned, 
35 restriction endonuclease mapped and sequenced previously 
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[Richaud et al. (1986) J. Bacterid. 166:297-300]. For 
the present invention the dapA gene was obtained on a 
bacteriophage lambda clone from an ordered library of 
3400 overlapping segments of cloned £. coli DNA 
constructed by Kohara, Akiyama and Isono [Kohara et al. 
(1987) Cell 50:595-508, see Example 1]. From the 
knowledge of the map position of dapA at 53 min on the 
£. coli genetic map [Bachman (1983) Microbiol. Rev. 
47:180-230], the restriction endonuclease map of the 
cloned gene [Richaud et al. (1986) J. Bacteriol. 
166:297-300], and the restriction endonuclease map of 
the cloned DNA fragments in the £. coli library [Kohara 
et al. (1987) Cell 50:595-508], it was possible to 
choose lambda phages 4C11 and 5A8 [Kohara et al. (1987) 
Cell 50:595-508] as likely candidates for carrying the 
dae& gene. The phages were grown in liquid culture from 
single plaques as described [see Current Protocols in 
Molecular Biology (1987) Ausubel et al. eds., John Wiley 
& Sons New York] using LE392 as host [see Sambrook 
et al. (1989) Molecular Cloning: a Laboratory Manual, 
Cold Spring Harbor Laboratory Press]. Phage DNA was 
prepared by phenol extraction as described [see Current 
Protocols in Molecular Biology (1987) Ausubel et al. 
eds., John Wiley & Sons New York]. Both phages 
contained an approximately 2.8 kb Pst I DNA fragment 
expected for the daeA gene [Richaud et al. (1986) J. 
Bacteriol. 166:297-300]. The fragment was isolated from 
the digest of phage 5A8 and inserted into Pst I digested 
vector pBR322 yielding plasmid pBT427. 

The Corynebacterium dapA gene (cordaeA) was 
isolated from genomic DNA from ATCC strain 13032 using 
polymerase chain reaction (PCR) . The nucleotide 
sequence of the Corynebacterivm dapA gene has been 
published [Bonnassie et al. (1990) Nucleic Acids Res. 
18:6421]. From the sequence it was possible to design 
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oligonucleotide primers for PCR that would allow 
amplification of a DNA fragment containing the gene, and 
at the same time add unique restriction endonuclease 
sites at the start codon (Nco I) and just past the stop 
5 codon (EcoR I) of the gene. The oligonucleotide primers 
used were: 

SEQ ID NO: 4: 

CCCGGGCCAT GGCTACAGGT TTAACAGCTA AGACCGGAGT AGAGCACT 

10 

SEQ ID NO: 5: 

GATATCGAAT TCTCATTATA GAACTCCAGC TTTTTTC 

PCR was performed using a Perkin-Elmer Cetus kit 

15 according to the instructions of the vendor on a 

thermocycler manufactured by the same company. The 
reaction product, when run on an agarose gel and stained 
with ethidium bromide, showed a strong DNA band of the 
size expected for the Corynebacterium dapA gene, about 

20 900 bp. The PCR-generated fragment was digested with 
restriction endonucleases Nco I and EcoR I and inserted 
into expression vector pBT430 (see Example 2) digested 
with the same enzymes. In addition to introducing an 
Nco I site at the translation start codon, the PCR 

25 primers also resulted in a change of the second codon 
from AGC coding for serine to GCT coding for alanine. 
Several clones that expressed active, lysine-insensitive 
DHDPS (see Example 4) were isolated, indicating that the 
second codon amino acid substitution did not affect 

30 activity; one clone was designated FS766. 

The Nco I to EcoR I fragment carrying the 
PCR-generated Co ryneba cter ium dapA gene was subcloned 
into the phagemid vector pGEM-9Zf (-) from Promega, 
single-stranded DNA was prepared and sequenced. This 

35 sequence is shown in SEQ ID NO: 6. 
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Aside from the differences in the second codon 
already mentioned, the sequence matched the published 
sequence except at two positions, nucleotides 798 and 
799. In the published sequence these are TC, while in 
5 the gene shown in SEQ ID NO: 6 they are CT. This change 
results in an amino acid substitution of leucine for 
serine. The reason for this difference is not known. 
It may be due to an error in the published sequence, the 
difference in strains used to isolate the gene, or a 

10 PCR-generated error. The latter seems unlikely since 

the same change was observed in at least 3 independently 
isolated PCR-generated dapA genes. The difference has 
no apparent effect on DHDPS enzyme activity (see 
Example 4) . 

15 EXAMPLS 4 

High level expression of the E. eoli anri 
Corvnebacterium alutamicum dapA o^ nes in E. eoli 
An Nco I (CCATGG) site was inserted at the 
translation initiation codon of the £. eoli dapA gene 

20 using oligonucleotide-directed mutagenesis. The 2.8 kb 
Pst I DNA fragment carrying the dapA gene in plasmid 
pBT427 (see Example 3) was inserted into the Pst I site 
of phagemid vector pTZ18R (Pharmacia) yielding pBT431. 
The orientation of the dapA gene was such that the 

25 coding strand would be present on the single-stranded 
phagemid DNA. Oligonucleotide-directed mutagenesis was 
carried out using a Muta-Gene kit from Bio-Rad according 
to the manufacturer's protocol with the mutagenic primer 
shown below: 

30 

SEQ ID NO: 7: 

CTTCCCGTGA CCATGGGCCA TC 

Putative mutants were screened for the presence of an 
35 Nco I site and a plasmid, designated pBT437, was shown 
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to have the the proper sequence in the vicinity of the 
mutation by DNA sequencing. The addition of an Nco I 
site at the translation start codon also resulted in a 
change of the second codon from TTC coding for 
5 phenylalanine to GTC coding for valine. 

To achieve high level expression of the dapA genes 
in £. coli the bacterial expression vector pBT430 (see 
Example 2) was used. The £. coli dapA gene was cut out 
of plasmid pBT437 as an 1150 bp Nco I-Hind III fragment 

10 and inserted into the expression vector pBT430 digested 
with the same enzymes, yielding plasmid pBT442. For 
expression of the Co r y n eba ct e r ium dapA gene, the 910 bp 
Nco I to EcoR I fragment of SEQ ID NO: 6 inserted in 
pBT430 (pFS766, see Example 3) was used. 

15 For high level expression each of the plasmids was 

transformed into £. coli strain BL21(DE3) [Studier 
et al. (1986) J. Mol. Biol. 189:113-130] . Cultures were 
grown in LB medium containing ampicillin (100 mg/L) at 
25°C. At an optical density at 600 nm of approximately 

20 1, IPTG (isopropylthio-p-galactoside, the inducer) was 

added to a final concentration of 0.4 roM and incubation 
was continued for 3 h at 25°C. The cells were collected 
by centrifugation and resuspended in l/20th (or l/100th) 
the original culture volume in 50 mM NaCl; 50 mM 

25 Tris-Cl, pH 7.5; 1 mM EDTA, and frozen at -20°C. Frozen 
aliquots of 1 mL were thawed at 37°C and sonicated, in 
an ice-water bath, to lyse the cells. The lysate was 
centrifuged at 4°C for 5 min at 15,000 rpm. The 
supernatant was removed and the pellet was resuspended 

30 in 1 mL of the above buffer. 

The supernatant and pellet fractions of uninduced 
and IPTG- induced cultures of BL21 (DE3) /pBT442 or 
BL21 (DE3) /pFS7 66 were analyzed by SDS polyacrylamide gel 
electrophoresis. The major protein visible by Coomassie 

35 blue staining in the supernatant and pellet fractions of 
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both induced cultures had a molecular weight of 32-34 
kd, the expected size for DHDPS. Even in the uninduced 
cultures this protein was the most prominent protein 
produced. 

In the BL21(DE3)/pBT442 IPTG-induced culture about 
80% of the DHDPS protein was in the supernatant and 
DHDPS represented 10-20% of the total protein in the 
extract. In the BL21 (DE3) /pFS766 IPTG-induced culture 
more than 50% of the DHDPS protein was in the pellet 
fraction. The pellet fractions in both cases were 
90-95% pure DHDPS, with no other single protein present 
in significant amounts. Thus, these fractions were pure 
enough for use in the generation of antibodies. The 
pellet fractions containing 2-4 milligrams of either 
E. QQlL DHDPS or Corynebacterium DHDPS were solubilized 
in 50 mM NaCl; 50 mM Tris-Cl, pH 7.5; 1 mM EDTA, 0.2mM 
dithiothreitol, 0.2% SDS and sent to Hazelton Research 
Facility (310 Swampridge Road, Denver, PA 17517) to have 
rabbit antibodies raised against the proteins. 

DHDPS enzyme activity was assayed as follows: 
Assay mix (for 10 X 1.0 mL assay tubes or 40 X 0.25 mL 
for microtiter dish); made fresh, just before use: 

2.5mL H2O 

0 . 5mL 1 . 0M Tris-HCl pH8 . 0 

0.5mL 0.1M Na Pyruvate 

O.SmL o-Aminobenzaldehyde (lOmg/mL in ethanol) 

25UL 1.0M DL-Aspartic-p-semialdehyde (ASA) in 1.0N 



HCl 



Assay (l.OmL): MicroAssay (0.25mL) : 



DHDPS assay mix 
enzyme extract + H2O; 
lOxnM L-lysine 



5|iL or 20HL 



0.40mL 



O.lOmL 



O.lOmL 



.025mL 



1J1L or 5|IL 



35 
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Incubate at 30°C for desired time. Stop by addition of: 
1.0N HC1 0.50mL 0.125mL 

Color allowed to develop for 30-60 min. Precipitate 
5 spun down in eppendorf centrifuge. OD540 vs 0 min read 
as blank. For MicroAssay, aliquot 0.2mL into microtiter 
well and read at OD530. 

The specific activity of £. coli DHDPS in the 
supernatant fraction of induced extracts was about 

10 50 OD540 units per minute per milligram protein in a 
1.0 mil assay. £. coli DHDPS was sensitive to the 
presence of L-lysine in the assay. Fifty percent 
inhibition was found at a concentration of about 0.5mM. 
For Corynebacterium DHDPS, the activity was measured in 

15 the supernatant fraction of uninduced extracts, rather 

than induced extracts . Enzyme activity was about 4 OD530 
units per minute per milligram protein in a 0.25 mL 
assay. In contrast to £. QqUl DHDPS, Corynebacterium 
DHDPS was not inhibited at all by L-lysine, even at a 

20 concentration of 70mM. 

EXAMPLE 5 

Excretion of amino acids bv E. coli expressing 

high levels of DHDPS and/or AKIII 
The £. coli expression cassette with the £. £&1A 
25 dapA gene linked to the T7 RNA polymerase promoter was 
isolated by digesting pBT442 (see Example 4) with Bgl II 
and BamH I separating the digestion products via agarose 
gel electrophoresis and eluting the approximately 
1250 bp fragment from the gel. This fragment was 
30 inserted into the BamH I site of plasmids pBT461 
(containing the T7 promoter/l^sfL gene) and pBT492 
(containing the T7 promoter/lys£-M4 gene) . Inserts 
where transcription of both genes would be in the same 
direction were identified by restriction endonuclease 
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analysis yielding plasmids pBT517 (T7/daE& + T7/i^s£-M4) 
and pBT519 (T7/ dapA + T7/ lysC ) . 

In order to induce E.. coli to produce and excrete 
amino acids, these plasmids, as well as plasmids pBT442, 
5 pBT4 61 and pBT4 92 (and pBR322 as a control) were 
transformed into £. coli strain BL21 (DE3) [Studier 
et al. (1986) J. Mol. Biol. 189:113-130]. All of these 
plasmids, but especially pBT517 and pBT519, are somewhat 
unstable in this host strain, necessitating careful 

10 maintenance of selection for ampicillin resistance 
during growth. 

All strains were grown in minimal salts M9 media 
[see Sambrook et al. (1989) Molecular Cloning: a 
Laboratory Manual, Cold Spring Harbor Laboratory Press] 

15 supplemented with ampicillin to maintain selection for 
the plasmids overnight at 37°C. Cultures were collected 
when they reached an OD600 of 1. Cells were removed by 
centrifugation and the supernatants (3 mL) were passed 
through 0.2 micron filters to remove remaining cells and 

20 large molecules. Five microliter aliquots of the 
supernatant fractions were analyzed for amino acid 
composition with a Beckman Model 6300 amino acid 
analyzer using post-column ninhydrin detection. Results 
are shown in Table 1. 



TABLE ] 



Amino Acid 


Concentration 


in Culture 


Supernatants 


CmM] 


Plasmid 


Lys 


Thr 


Met 


Ala 




Asp 


filu 


pBR322 


0 


0 


0 


0.05 


0.1 


0 


0 


PBT442 


0.48 


0 


0 


0.04 


0.06 


0 


0 


PBT461 


0.14 


0.05 


0 


0.02 


0.03 


0 


0 


pBT492 


0.16 


0.07 


0 


0.02 


0.03 


0 


0 


PBT517 


0.18 


0 


0.01 


0 


0 


0.02 


0.02 


pBT519 


0.14 


0 


0.01 


0 


0 


0.01 


0 
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All of the plasmids, except the pBR322 control, 
lead to the excretion of lysine into the culture medium. 
Expression of the lysC or the l£s£-M4 gene lead to both 
lysine and threonine excretion. Expression of lys£-M4 + 
5 dapA lead to excretion of lysine, methionine, aspartic 
acid and glutamic acid, but not threonine. In addition, 
alanine and valine were not detected in the culture 
supernatant. Similar results were obtained with lysC + 
dapA , except that no glutamic acid was excreted. 

10 EXAMPLE 6 

Construct ion of Chimeric daoA, IvsC and lvsC-M4 

Genes fo r Expression in Plants 
Several gene expression cassettes were used for 
construction of chimeric genes for expression of 

15 ecodapA, cordapA, lysC and lys£L-M4 in plants. A leaf 
expression cassette (Figure la) is composed of the 35S 
promoter of cauliflower mosaic virus [Odell et al. (1985) 
Nature 313:810-812; Hull et al. (1987) Virology 
86:482-493], the translation leader from the chlorophyll 

20 a/b binding protein (Cab) gene, [Dunsmuir (1985) Nucleic 
Acids Res, 13:2503-2518] and 3* transcription 
termination region from the nopaline synthase (Nos) gene 
[Depicker et al. (1982) J. Mol. Appl. Genet. 1:561-570]. 
Between the 5' and 3 1 regions are the restriction 

25 endonuclease sites Nco I (which includes the ATG 

translation initiation codon) , EcoR I, Sma I and Kpn I. 
The entire cassette is flanked by Sal I sites; there is 
also a BamH I site upstream of the cassette. 

A seed-specific expression cassette (Figure lb) is 

30 composed of the promoter and transcription terminator 

from the gene encoding the p subunit of the seed storage 
protein phaseolin from the bean Phaseolus vulgaris 
[Doyle et al. (1986) J. Biol. Chem. 261:9228-9238]. The 
phaseolin cassette includes about 500 nucleotides 

35 upstream (5 T ) from the translation initiation codon and 
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about 1650 nucleotides downstream (3 1 ) from the 
translation stop codon of phaseolin. Between the 5' and 
3 1 regions are the unique restriction endonuclease sites 
Nco I (which includes the ATG translation initiation 
5 codon), Sma I, Kpn I and Xba I. The entire cassette is 
flanked by Hind III sites. 

A second seed expression cassette was used for the 
cordspA gene. This was composed of the promoter and 
transcription terminator from the soybean Kunitz trysin 

10 inhibitor 3 (KTI3) gene [Jofuku et al. (1989) Plant Cell 
1:427-435]. The KTI3 cassette includes about 2000 
nucleotides upstream (5') from the translation 
initiation codon and about 240 nucleotides downstream 
(3') from the translation stop codon of phaseolin. 

15 Between the 5* and 3' regions are the unique restriction 
endonuclease sites Nco I (which includes the ATG 
translation initiation codon), Xba I, Kpn I and Sma I. 
The entire cassette is flanked by BamH I sites. 

A constitutive expression cassette for corn was 

20 used for expression of the lysC -M4 gene and the ecodaE& 
gene. It was composed of a chimeric promoter derived 
from pieces of two corn promoters and modified by in 
vitro site-specific mutagenesis to yield a high level 
constitutive promoter and a 3 1 region from a corn gene 

25 of unknown function. Between the 5' and 3 1 regions are 
the unique restriction endonuclease sites Nco I (which 
includes the ATG translation initiation codon) , Sma I 
and Bgl II. The nucleotide sequence of the constitutive 
corn expression cassette is shown in SEQ ID NO: 16. 

30 Plant amino acid biosynthetic enzymes are known to 

be localized in the chloroplasts and therefore are 
synthesized with a chloroplast targeting signal. 
Bacterial proteins such as DHDPS and AKIII have no such 
signal. A chloroplast transit sequence (cts) was 

35 therefore fused to the eco dapA , co rdapA , lysC P and 
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lysC- M4 coding sequence in some chimeric genes. The cts 
used was based on the the cts of the small subunit of 
ribulose 1, 5-bisphosphate carboxylase from soybean 
[Berry-Lowe et al. (1982) J. Mol. Appl. Genet. 
5 1:483-498], The oligonucleotides SEQ ID N0S:8-11 were 
synthesized and used as described below. For corn the 
cts used was based on the the cts of the small subunit 
of ribulose 1,5-bisphosphate carboxylase from corn 
[Lebrun et al. (1987) Nucleic Acids Res. 15:4360] and is 
10 designated mcts to distinguish it from the soybean cts. 
The oligonucleotides SEQ ID NOS: 17-22 were synthesized 
and used as described below* 

Fourteen chimeric genes were created: 

No. 1) 35S promoter/Cab leader/lys£/Nos 3' 
15 No. 2) 35S promoter/Cab leader/cts/lys£/Nos 3 f 

No. 3) 35S promoter/Cab leader /cts/l£££-M4/Nos 
3 f 

No. 4) phaseolin 5 f region/cts/l£s£/phaseolin 3 f 
region 

20 No. 5) phaseolin 5 f region /cts/l^a£-M4 /phaseolin 

3 1 region 

No. 6) 35S promoter/Cab leader /ecodap&/Nos 3' 
No. 7) 35S promoter/Cab leader/ cts/ecodaB&/Nos 3 
No. 8) phaseolin 5' region/ecodapA/phaseolin 3* 

25 region 

No. 9) phaseolin 5' region/cts/ecodaj>A/phaseolin 

3' region 

No. 10) 35S promoter/Cab leader/cts/cordapA/Nos 3 
No. 11) phaseolin 5 f region/cts/cordsp&/phaseolin 
30 3' region 

No. 12) KTI3 5' region/cts/cordSEA/KTI3 3' region 
No. 13) HH534 5' region/mcts/l^-M4/HH2-l 3 1 
region 

No. 14) HH534 5 1 region/met s/ecodap£/HH2-l 3' 
35 region 
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A 1440 bp Nco I-Hpa I fragment containing the 
entire lysC coding region plus about 90 bp of 3' non- 
coding sequence was isolated from an agarose- gel 
following electrophoresis and inserted into the leaf 
expression cassette digested with Nco I and Sma I 
(chimeric gene No. 1), yielding plasmid pBT483. 

Oligonucleotides SEQ ID NO: 8 and SEQ ID NO: 9, which 
encode the carboxy terminal part of the chloroplast 
targeting signal, were annealed, resulting in Nco I 
compatible ends, purified via polyacrylaraide gel 
electrophoresis, and inserted into Nco I digested 
pBT4 61. The insertion of the correct sequence in the 
correct orientation was verified by DNA sequencing 
yielding pBT496. Oligonucleotides SEQ ID NO: 10 and SEQ 
ID NO: 11, which encode the amino terminal part of the 
chloroplast targeting signal, were annealed, resulting 
in Nco I compatible ends, purified via polyacrylamide 
gel electrophoresis, and inserted into Nco I digested 
pBT496. The insertion of the correct sequence in the 
correct orientation was verified by DNA sequencing 
yielding pBT521. Thus the cts was fused to the lysC 
gene . 

To fuse the cts to the lys£-M4 gene, pBT521 was 
digested with Sal I, and an approximately 900 bp DNA 
fragment that included the cts and the amino terminal 
coding region of lysC was isolated. This fragment was 
inserted into Sal I digested pBT492, effectively 
replacing the amino terminal coding region of lysC- M4 
with the fused cts and the amino terminal coding region 
of Since the mutation that resulted in lysine- 

insensitivity was not in the replaced fragment, the new 
plasmid, pBT523, carried the cts fused to lys£-M4. 

The 1600 bp Nco I-Hpa I fragment containing the cts 
fused to lysC plus about 90 bp of 3' non-coding sequence 
was isolated and inserted into the leaf expression 
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cassette digested with Nco I and Sma I (chimeric gene 
No. 2) , yielding plasmid pBT541 and the seed-specific 
expression cassette digested with Nco I and Sma I 
(chimeric gene No. 4), yielding plasmid pBT543. 
5 Similarly, the 1600 bp Nco I-Hpa I fragment 

containing the cts fused to lysC- M4 plus about 90 bp of 
3 r non-coding sequence was isolated and inserted into 
the leaf expression cassette digested with Nco I and 
Sma I (chimeric gene No. 3), yielding plasmid pBT540 and 
10 the seed-specific expression cassette digested with 

Nco I and Sma I (chimeric gene No. 5> r yielding plasmid 
pBT544. 

Before insertion into the expression cassettes, the 
ec odapA gene was modified to insert a restriction 
15 endonuclease site, Kpn I, just after the translation 

stop codon . The oligonucleotides SEQ ID NOS : 12-13 were 
synthesized for this purpose: 

SEQ ID NO: 12: 
20 CCGGTTTGCT GTAATAGGTA CCA 

SEQ ID NO: 13: 

AGCTTGGTAC CTATTACAGC AAACCGGCAT G 

25 Oligonucleotides SEQ ID NO: 12 and SEQ ID NO: 13 were 

annealed, resulting in an Sph I compatible end on one 
end and a Hind III compatible end on the other and 
inserted into Sph I plus Hind III digested pBT437. The 
insertion of the correct sequence was verified by DNA 

30 sequencing yielding pBT443. 

An 880 bp Nco I-Kpn I fragment from pBT443 
containing the entire ecodsE& coding region was isolated 
from an agarose gel following electrophoresis and 
inserted into the leaf expression cassette digested with 

35 Nco I and Kpn I (chimeric gene No. 6), yielding plasmid 
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pBT450 and into the seed-specific expression cassette 
digested with Nco I and Kpn I (chimeric gene No. 8), 
yielding plasmid pBT4 94. 

Oligonucleotides SEQ ID NO:8 and SEQ ID NO: 9, which 
5 encode the carboxy terminal part of the chloroplast 
targeting signal, were annealed resulting in Nco I 
compatible ends, purified via polyacrylamide gel 
electrophoresis, and inserted into Nco I digested 
pBT450. The insertion of the correct sequence in the 

10 correct orientation was verified by DNA sequencing 
yielding pBT451. A 950 bp Nco I-Kpn I fragment from 
pBT451 encoding the carboxy terminal part of the 
chloroplast targeting signal fused to the entire ec odapA 
coding region was isolated from an agarose gel following 

15 electrophoresis and inserted into the seed-specific 
expression cassette digested with Nco I and Kpn I, 
yielding plasmid pBT4 95. Oligonucleotides SEQ ID NO: 10: 
and SEQ ID NO: 11:, which encode the amino terminal part 
of the chloroplast targeting signal, were annealed 

20 resulting in Nco I compatible ends, purified via 

polyacrylamide gel electrophoresis, and inserted into 
Nco I digested pBT451 and pBT495. Insertion of the 
correct sequence in the correct orientation was verified 
by DNA sequencing yielding pBT455 and pBT520, 

25 respectively. Thus the cts was fused to the eco dapft 

gene in the leaf expression cassette (chimeric gene No. 
7) and the seed-specific expression cassette (chimeric 
gene No. 9) . 

An 870 bp Nco I-EcoR I fragment from pFS766 

30 containing the entire cordap& coding region was isolated 
from an agarose gel following electrophoresis and 
inserted into the leaf expression cassette digested with 
Nco I and EcoR I, yielding plasmid pFS789 To attach 
the cts to the cordage gene a DNA fragment containing 
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the entire cts was prepared using PCR. The template DNA 
was pBT540 and the oligonucleotide primers used were: 

SEQ ID NO: 14: 
5 GCTTCCTCAA TGATCTCCTC CCCAGCT 

SEQ ID NO: 15: 

CATTGTACTC TTCCACCGTT GCTAGCAA 

10 PCR was performed using a Perkin-Elmer Cetus kit 

according to the instructions of the vendor on a 
thermocycler manufactured by the same company. The 
PCR-generated 160 bp fragment was treated with T4 DNA 
polymerase in the presence of the 4 deoxyribonucleotide 

15 triphosphates to obtain a blunt-ended fragment. The cts 
fragment was inserted into pFS789 which had been 
digested with Nco I and treated with the Klenow fragment 
of DNA polymerase to fill in the 5 1 overhangs . The 
inserted fragment and the vector/insert junctions were 

20 determined to be correct by DNA sequencing, yielding 
pFS846 containing chimeric gene No. 10. 

A 1030 bp Nco I-Kpn I fragment from pFS846 
containing the cts attached to the cor^iapA coding region 
was isolated from an agarose gel following electro- 

25 phoresis and inserted into the phaseolin seed expression 
cassette digested with Nco I and Kpn I, yielding plasmid 
pFS889 containing chimeric gene No. 11. Similar ly, the 
1030 bp Nco I-Kpn I fragment from pFS846 was inserted 
into the KTI3 seed expression cassette digested with 

30 Nco I and Kpn I, yielding plasmid pFS862 containing 
chimeric gene No. 12. 

Oligonucleotides SEQ ID NO: 17 and SEQ ID NO: 18, 
which encode the carboxy terminal part of the corn 
chloroplast targeting signal, were annealed, resulting 

35 in Xba I and Nco I compatible ends, purified via 
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polyacrylamide gel electrophoresis, and inserted into 
Xba I plus Nco I digested pBT4 92 (see Example 2) . The 
insertion of the correct sequence was verified by DNA 
sequencing yielding pBT556. Oligonucleotides SEQ ID 
5 NO: 19 and SEQ ID NO:20, which encode the middle part of 
the chloroplast targeting signal, were annealed, 
resulting in Bgl II and Xba I compatible ends, purified 
via polyacrylamide gel electrophoresis, and inserted 
into Bgl II and Xba I digested pBT556. The insertion of 

10 the correct sequence was verified by DNA sequencing 

yielding pBT557 . Oligonucleotides SEQ ID NO: 21 and SEQ 
ID NO: 22, which encode the amino terminal part of the 
chloroplast targeting signal, were annealed, resulting 
in Nco I and Afl II compatible ends, purified via 

15 polyacrylamide gel electrophoresis, and inserted into 
Nco I and Afl II digested pBT557. The insertion of the 
correct sequence was verified by DNA sequencing yielding 
pBT558. Thus the met s was fused to the lysO M4 gene. 
A 1.6 kb Nco I-Hpa I fragment from pBT558 

20 containing the mcts attached to the lysC -M4 gene was 
isolated from an agarose gel following electrophoresis 
and inserted into the constitutive corn expression 
cassette digested with Nco I and Sma I, yielding plasmid 
pBT573 containing chimeric gene No. 13. 

25 To attach the mcts to the ec odapA gene a DNA 

fragment containing the entire mcts was prepared using 
PCR as described above. The template DNA was pBT558 and 
the oligonucleotide primers used were: 

30 SEQ ID NO: 23: 

GCGCCCACCG TGATGA 

SEQ ID NO: 24: 
CACCGGATTC TTCCGC 
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The mcts fragment was inserted into pBT450 (above) 
which had been digested with Nco I and treated with the 
Klenow fragment of DNA polymerase to fill in the 5' 
overhangs. The inserted fragment and the vector/insert 
5 junctions were determined to be correct by DNA 
sequencing, yielding pBT576. Plasmid pBT576 was 
digested with Asp718, treated with the Klenow fragment 
of DNA polymerase to yield a blunt-ended fragment f and 
then digested with Nco I. The resulting 1030 bp 

10 Nco I-blunt-ended fragment containing the ecodaE& 9 ene 
attached to the mcts was isolated from an agarose gel 
following electrophoresis. This fragment was inserted 
into the constitutive corn expression cassette digested 
with Bgl II, treated with the Klenow fragment of DNA 

15 polymerase to yield a blunt-ended fragment, and then 

digested with Nco I, yielding plasmid pBT583 containing 
chimeric gene No. 14. 

EXAMPLE 7 
Transformation of Tobacco with the 
20 35S Promo ter/lysC Chimeric Genes 

Transformation of tobacco with the 35S 
promoter/ lysC chimeric genes was effected according to 
the following: 

The 35S promoter/Cab leader/l£££/Nos 3', 35S 
25 promoter/Cab leader/cts/Iys£/Nos 3 f , and 35S 

promoter/Cab Ieader/cts/lys£-M4/Nos 3 1 chimeric genes 
were isolated as 3.5-3.6 kb BamH I-EcoR I fragments and 
inserted into BamH I-EcoR I digested vector pZS97K 
(Figure 2), yielding plasmids pBT497 r pBT545 and pBT542, 
30 respectively. The vector is part of a binary Ti plasmid 
vector system [Bevan, (1984) Nucl. Acids. Res. 
12:8711-8720] of Aarobacterium tumef aciens - The vector 
contains: (1) the chimeric gene nopaline synthase 
promoter/neomycin phosphotransferase coding region 
35 (nos:NPT II) as a selectable marker for transformed 
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plant cells [Bevan et al. (1983) Nature 304:184-186]; 

(2) the left and right borders of the T-DNA of the Ti 
plasmid [Bevan (1984) Nucl. Acids. Res. 12:8711-8720]; 

(3) the £. coli lacZ a-coraplementing segment [Vieria and 

5 Messing (1982) Gene 19:259-267] with unique restriction 
endonuclease sites for EcoR I, Kpn I f BamH I and Sal I; 

(4) the bacterial replication origin from the 
PseudonionflS plasmid pVSl [Itoh et al. (1984) Plasmid 
11:206-220]; and (5) the bacterial neomycin 

10 phosphotransferase gene from Tn5 [Berg et al. (1975) 
Proc. Natl. Acad Sci . U.S.A. 72:3628-3632] as a 
selectable marker for transformed £. tumefaciens . 

The 35S promoter/Cab leader /cts/ lysC /Nos 3 f , and 
35S promoter/Cab leader/cts/ lysC- M4/Nos 3' chimeric 

15 genes were also inserted into the binary vector pBT456, 
yielding pBT547 and pBT54 6, respectively. This vector 
is pZS97K, into which the chimeric gene 35S promoter/Cab 
leader/cts/dacA/Nos 3 1 had previously been inserted as a 
BamH I-Sal I fragment (see Example 9) . In the cloning 

20 process large deletions of the dapA chimeric gene 
occurred. As a consequence these plasmids are 
equivalent to pBT545 and pBT542, in that the only 
transgene expressed in plants (other than the selectable 
marker gene, NPT II) was 35S promoter /Cab 

25 leader/cts/l£s£/Nos 3' or 35S promoter/Cab 
leader/cts/lys£-M4/Nos 3". 

The binary vectors containing the chimeric lysC 
genes were transferred by tri-parental matings [Ruvkin 
et al. (1981) Nature 289:85-88] to Aarobacterium strain 

30 LBA4404/pAL4404 [Hockema et al (1983), Nature 

303:179-180]. The Aarobacterium transf ormants were used 
to inoculate tobacco leaf disks [Horsch et al. (1985) 
Science 227:1229-1231]. Transgenic plants were 
regenerated in selective medium containing kanamycin. 
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To assay for expression of the chimeric genes in 
leaves of the transformed plants,- protein was extracted 
as follows. Approximately 2.5 g of young plant leaves , 
with the midrib removed, were placed in a dounce 
5 homogenizer with 0.2 g of polyvinyl polypyrrolidone and 
11 mL of 50mM Tris-HCl pH8.0, 50mM NaCI, ImM EDTA (TNE) 
and ground thoroughly. The suspension was further 
homogenized by a 20 sec treatment with a Brinkman 
Polytron Homogenizer operated at setting 7. The 

10 resultant suspensions were centrifuged at 16,000 rpm for 
20 min at 4°C in a Dupont-Sorvall superspeed centrifuge 
using an SS34 rotor to remove particulates . The 
supernatant was decanted, the volume was adjusted to be 
10 mL by addition of TNE if necessary, and 8 raL of cold, 

15 saturated ammonium sulfate was added. The mixture was 
set on ice for 30 min and centrifuged as described 
above. The supernatant was decanted and the pellet, 
which contained the AKIII protein, was resuspended in 
1 mL of TNE and desalted by passage over a Sephadex G-25 

20 M column (Column PD-10, Pharmacia) . 

For immunological characterization, three volumes 
of extract were mixed with 1 volume of 4 X SDS-gel 
sample buffer (0.17M Tris-HCl pH6.8, 6.7% SDS, 16.7% 
[5-mercaptoethanol, 33% glycerol) and 3 [XL from each 

25 extract were run per lane on an SDS poly aery lamide gel f 
with bacterially produced AKIII serving as a size 
standard and protein extracted from untransf ormed 
tobacco leaves serving as a negative control. The 
proteins were then electrophoretically blotted onto a 

30 nitrocellulose membrane (Western Blot) . The membranes 
were exposed to the AKIII antibodies prepared as 
described in Example 2 at a 1:5000 dilution of the 
rabbit serum using standard protocol provided by BioRad 
with their Immun-Blot Kit. Following rinsing to remove 

35 unbound primary antibody, the membranes were exposed to 
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the secondary antibody, donkey anti-rabbit Ig conjugated 
to horseradish peroxidase (Amersham) at a 1:3000 
dilution. Following rinsing to remove unbound secondary 
antibody, the membranes were exposed to Amersham 
5 chemilurainescence reagent and X-ray film. 

Seven of thirteen transformants containing the 
chimeric gene, 35S promoter/Cab leader /cts/Ixs£-M4/Nos 
3 ' , and thirteen of seventeen transformants containing 
the chimeric gene, 35S promoter/Cab leader/cts/ lysC /Nos 

10 3 T , produced AKIII protein (Table 2). In all cases 
protein which reacted with the AKIII antibody was of 
several sizes. Approximately equal quantities of 
proteins equal in size to AKIII produced in £. coli . and 
a protein about 6 kd larger were evident in all samples, 

15 suggesting that the chloroplast targeting signal had 

been removed from about half of the protein synthesized. 
This further suggests that about half of the the protein 
entered the chloroplast. In addition, a considerable 
amount of protein of higher molecular weight was 

20 observed. The origin of this protein is unclear; the 
total amount present was equal or slightly greater than 
the amounts of the mature and putative AKIII precursor 
proteins combined. 

The leaf extracts were assayed for AK activity as 

25 described in Example 2. AKIII could be distinguished 
from endogenous AK activity, if it were present, by its 
increased resistance to lysine plus threonine. 
Unfortunately, however, this assay was not sensitive 
enough to reliably detect AKIII activity in these 

30 extracts. Zero of four transformants containing the 
chimeric gene, 35S promoter/Cab leader/lys£/Nos 3', 
showed AKIII activity. Only one extract, from a 
transformant containing the 35S promoter/Cab 
leader/cts/J^££-M4/Nos 3 f gene, produced a convincing 

35 level of enzyme activity. This came from transformant 
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546-49A, and was also the extract that showed the 
highest level of AKIII-M4 protein via Western blot. 

An alternative method to detect the expression of 
active AKIII enzyme was to evaluate the sensitivity or 
5 resistance of leaf tissue to high concentrations of 
lysine plus threonine. Growth of cell cultures and 
seedlings of many plants is inhibited by high 
concentrations of lysine plus threonine; this is 
reversed by addition of methionine (or homoserine which 

10 is converted to methionine in vivo) . Lysine plus 

threonine inhibition is thought to result from feedback 
inhibition of endogenous AK, which reduces flux through 
the pathway leading to starvation for methionine. In 
tobacco there are two AK enzymes in leaves, one lysine- 

15 sensitive and one threonine sensitive [Negrutui et al. 
(1984) Theor. Appl. Genet. 68:11-20]. High 
concentrations of lysine plus threonine inhibit growth 
of shoots from tobacco leaf disks and inhibition is 
reversed by addition of low concentrations of 

20 methionine. Thus, growth inhibition is presumably due 
to inhibition of the two AK isozymes. 

Expression of active lysine and threonine 
insensitive AKIII-M4 would be predicted to reverse the 
growth inhibition. As can be seen in Table 2, this was 

25 observed. There is f in fact r a good correlation between 
the level of AKIII-M4 protein expressed and the 
resistance to lysine plus threonine inhibition. 
Expression of lysine-sensitive wild type AKIII does not 
have a similar effect. Only the highest expressing 

30 transformant showed any resistance to lysine plus 

threonine inhibition, and this was much less dramatic 
than that observed with AKIII-M4. 

To measure free amino acid composition of the 
leaves, free amino acids were extracted as follows. 

35 Approximately 30-40 mg of young leaf tissue was chopped 
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with a razor and dropped into 0.6 mL of methanol/ 
chloroform/water mixed in ratio of 12v/5v/3v (MCW) on 
dry ice. After 10-30 min the suspensions were brought 
to room temperature and homogenized with an Omni 1000 
5 Handheld Rechargeable Homogenizer and then centrifuged 
in an eppendorf microcentrifuge for 3 min. 
Approximately 0.6mL of supernatant was decanted and an 
additional 0.2mL of MCW was added to the pellet which 
was then vortexed and centrifuged as above. The second 

10 supernatant, about 0.2mL, was added to the first. To 

this, 0.2mL of chloroform was added followed by 0.3mL of 
water. The mixture was vortexed and the centrifuged in 
an eppendorf microcentrifuge for about 3 min, the upper 
aqueous phase, approximately l.OmL, was removed, and was 

15 dried down in a Savant Speed Vac Concentrator. 

One-tenth of the sample was run on a Beckman Model 6300 
amino acid analyzer using post-column ninhydrin 
detection. Relative free amino acid levels in the 
leaves were compared as ratios of lysine or threonine to 

20 leucine, thus using leucine as an internal standard. 

There was no consistent effect of expression of AKIII or 
AKIII-M4 on the lysine or threonine (or any other amino 
acid) levels in the leaves (Table 2) . 

TABLE 2 

BT542 transf ormants : 35S promoter/Cab leader/cts/lya£-M4/Nos 3 1 
BT545 transf ormants: 35S promoter/Cab leader/ cts/lyaC/Nos 3* 

BT546 transf ormants: 35S promoter/Cab leader /cts / lyaC -M4/Nos 3' 
BT547 transf ormants: 35S promoter/Cab leader / cts/ ly sC /No a 3' 





FREE 


AMINO 


AKIII 




RESISTANCE 




ACIDS /LEAF 


ACTIVITY 


WESTERN 


TO Lys 3mM 


LINE 


K/L 


T/L 


U/MG/HR 


BLOT 


+ Thr 3mM 


542-5B 


0.5 


3.5 


0 






542-2 6A 


0.5 


3.3 


0 






542-27B 


0.5 


3.4 


0 


++ 


+++ 


542-35A 


0.5 


4.3 


0.01 






542-54A 


0.5 


2.8 


0 






542-57B 


0.5 


3.4 


0 




+ 
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LINE 


FREE AMINO 
ACIDS /LEAF 
K/L T/L 


AKIII 
ACTIVITY 

\j i no f nrv 


WESTERN 
BLOT 


RESISTANCE 
TO Lys 3mM 
+ Thr 3mM 


545-5A 


n.d. 


n.d. 


0.02 


++ 




545-7B 


0.5 


3.4 


0 


+ 




545-17B 


0.6 


2.5 


0.01 


+ 




545-27A 


0.6 


3.5 


0 


++ 




545-50E 


0.6 


3.6 


0.03 


++ 




545-52A 


U . D 




0.02 


- 




546-4A 


0.4 


4.5 


0 


+ 


+ 


546-24B 


0.6 


4.9 


0.04 


++ 


++ 


546-44A 


G.5 


O .U 


0.03 




++ 


546-49A 


0.7 


7.0 


0.10 


+++ 


+++ 


546-54A 


0.5 


6.4 


0 


+ 


+ 


546-56B 


0.5 


4.4 


0.01 






546-58B 


0.6 


8.0 


0 


+ 


•H- 


547-3D 


0.4 


5.4 


0 


++ 


- 


547-8B 


0.6 


5.0 


0.02 


- 




547-9A 


0.5 


4.3 


0.03 


+++ 




547-12A 


0.7 


3.9 


0 


+++ 


+ 


547-15B 


0.6 


4.5 


0 


+ 




547-16A 


0.5 


3.6 


0 






547-18A 


0.5 


4.0 




+++ 




547-22A 


0.8 


4.4 








547-25C 


0.5 


4.3 




+ 




547-28C 


0.6 


5.6 








547-29C 


0.5 


3.8 




+++ 
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EXAMPLE 9 
.Transformation of Tobacco, with the 
Phaseolin Promoter/lysC Chimeric Genes 
The phaseolin promoter/JL^fiC chimeric gene 
5 cassettes, phaseolin 5' region/cts/JLysC/phaseolin 3' 

region, and phaseolin 5* region/cts/iys£-M4 /phased in 3 f 
region (Example 6) were isolated as approximately 3.3 kb 
Hind III fragments. These fragments were inserted into 
the unique Hind III site of the binary vector pZS97 
10 (Figure 3) yielding pBT548 and pBT549, respectively. 

This vector is similar to pZS97K described in Example 7 
except for the presence of two additional unique cloning 
sites, Sma I and Hind III, and the bacterial (3-lactamase 

gene (causing ampicillin resistance) as a selectable 
15 marker for transformed tumefaciens instead of the 

bacterial neomycin phosphotransferase gene. 

The binary vectors containing the chimeric lysC 

genes were transferred by tri-parental matings to 

AgrobfrCterimu strain LBA4404/pAL4404, the Aarobacterium 
20 transformants were used to inoculate tobacco leaf disks 

and transgenic plants regenerated by the methods set out 

in Example 7 . 

To assay for expression of the chimeric genes in 

the seeds of the transformed plants, the plants were 
25 allowed to flower, self-pollinate and go to seed. Total 

proteins were extracted from mature seeds as follows. 

Approximately 3Q-40 mg of seeds were put into a 1.5mL 

disposable plastic microfuge tube and ground in 0.25mL 

of 50mM Tris-HCl pH6.8, 2mM EDTA, 1% SDS, 1% |5-mercapto- 

30 ethanol. The grinding was done using a motorized 

grinder with disposable plastic shafts designed to fit 
into the microfuge tube. The resultant suspensions were 
centrifuged for 5 min at room temperature in a microfuge 
to remove particulates. Three volumes of extract was 

35 mixed with 1 volume of 4 X SDS-gel sample buffer (0.17M 
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Tris-HCl pH6.8, 6.7% SDS, 16.7% p-mercaptoethanol, 33% 
glycerol) and 5\lL from each extract were run per lane on 
an SDS polyacrylamide gel, with bacterially produced 
AKIII serving as a size standard and protein extracted 
5 from untransformed tobacco seeds serving as a negative 
control. The proteins were then electrophoretically 
blotted onto a nitrocellulose membrane. The membranes 
were exposed to the AKIII antibodies (prepared as 
described in Example 2) at a 1:5000 dilution of the 

10 rabbit serum using standaxd protocol provided by BioRad 
with their Immun-Blot Kit. Following rinsing to remove 
unbound primary antibody the membranes were exposed to 
the secondary antibody, donkey anti-rabbit Ig conjugated 
to horseradish peroxidase (Amersham) at a 1:3000 

15 dilution. Following rinsing to remove unbound secondary 
antibody, the membranes were exposed to Amersham 
chemiluminescence reagent and X-ray film. 

Ten of eleven transformants containing the chimeric 
gene, phaseolin 5' region/cts/lys£/phaseolin 3* region, 

20 and ten of eleven transformants containing the chimeric 
gene, phaseolin 5' region/cts/lys£-M4/phaseolin 3 f 
region, produced AKIII protein (Table 3) . In all cases 
protein which reacted with the AKIII antibody was of 
several sizes. Approximately equal quantities of 

25 proteins equal in size to AKIII produced in solL, and 
about 6 kd larger were evident in all samples, 
suggesting that the chloroplast targeting signal had 
been removed from about half of the protein synthesized. 
This further suggests that about half of the the protein 

30 entered the chloroplast. In addition, some proteins of 
lower molecular weight were observed, probably 
representing breakdown products of the AKIII 
polypeptide. 

To measure free amino acid composition of the 

35 seeds, free amino acids were extracted from mature seeds 
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as follows. Approximately 30-40 mg of seeds and an 
approximately equal amount of sterilized sand were put 
into a 1.5mL disposable plastic microfuge tube along 
with 0.2mL of methanol/chlorof orm/water mixed in ratio 
5 of 12v/5v/3v (MCW) at room temperature. The seeds were 
ground using a motorized grinder with disposable plastic 
shafts designed to fit into the microfuge tube. After 
grinding an additional 0.5mL of MCW was added, the 
mixture was vortexed and then centrifuged in an 

10 eppendorf microcentrifuge for about 3 min. 

Approximately 0.6mL of supernatant was decanted and an 
additional 0.2mL of MCW was added to the pellet which 
was then vortexed and centrifuged as above. The second 
supernatant, about 0.2mL, was added to the first. To 

15 this, 0.2mL of chloroform was added followed by 0.3mL of 
water. The mixture was vortexed and then centrifuged in 
an eppendorf microcentrifuge for about 3 min, the upper 
aqueous phase, approximately l.OmL, was removed, and was 
dried down in a Savant Speed Vac Concentrator. The 

20 samples were hydrolyzed in 6N hydrochloric acid, 0.4% 
p-mercaptoethanol under nitrogen for 24 h at 1 10-12 0°C; 

1/4 of the sample was run on a Beckman Model 6300 amino 
acid analyzer using post-column ninhydrin detection. 
Relative free amino acid levels in the seeds were 
25 compared as ratios of lysine, methionine, threonine or 
isoleucine to leucine, thus using leucine as an internal 
standard. 

To measure the total amino acid composition of the 
seeds, 6 seeds were hydrolyzed in 6N hydrochloric acid, 
30 0.4% p-mercaptoethanol under nitrogen for 24 h at 

110-120°C; 1/10 of the sample was run on a Beckman Model 
6300 amino acid analyzer using post-column ninhydrin 
detection. Relative amino acid levels in the seeds were 
compared as ratios of lysine, methionine, threonine or 
35 isoleucine to leucine, thus using leucine as an internal 
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standard. Because the transgene was segregating in 
these self-pollinated progeny of the primary 
transformant and only six seeds were analyzed, there was 
expected to be some sampling error. Therefore, the 
5 measurement was repeated multiple times for some of the 
lines (Table 3) . 

Expression of the cts/lyaQ gene in the seeds 
resulted in a 2 to 4-fold increase in the level of free 
threonine in the seeds and a 2 to 3-fold increase in the 

10 level of free lysine in some cases. There was a good 
correlation between transformants expressing higher 
levels of AKIII protein and those having higher levels 
of free threonine, but this was not the case for lysine. 
These relatively small increases of free threonine or 

15 lysine were not sufficient to yield detectable increases 
in the levels of total threonine or lysine in the seeds. 
Expression of the cts/lys£-M4 gene in the seeds resulted 
in a 4 to 23-fold increase in the level of free 
threonine in the seeds and a 2 to 3-fold increase in the 

20 level of free lysine in some cases. There was a good 
correlation between transformants expressing higher 
levels of AKIII protein and those having higher levels 
of free threonine, but this was again not the case for 
lysine. The larger increases of free threonine were 

25 sufficient to yield detectable increases in the levels 
of total threonine in the seeds. Sixteen to twenty-five 
percent increases in total threonine content of the 
seeds were observed in three lines which were sampled 
multiple times. (Isoleucine to leucine ratios are shown 

30 for comparison.) The lines that showed increased total 
threonine were the same ones the showed the highest 
levels of increase in free threonine and high expression 
of the AKIII-M4 protein. From these results it can be 
estimated that free threonine represents about 1% of the 

35 total threonine present in a normal tobacco seed, but 
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about 18% of the total threonine present in seeds 
expressing high levels of AKIII-M4. 



TABLE 3 

BT548 Transf ormants : phaseolin 5* region/cts/Jjtafi/phaseolin 3' 
BT549 Transf ormants : phaseolin 5' region/cts/xxa£-M4/phaseolin 3' 



LINE 


SEED 
FREE 
K/L 


AMINO 
T/L 


ACID 
I/L 

X f J-i 


SEED 

TOTAL 

K/L 


AMINO 
T/T, 

X f u 


ACID 

T/L 

X/ XI 


MP QTPDXT 
rtLtO lEdi\N 


NORMAL 


0 4 9 


1 ^4 


v • DO 




n fin 

U • DO 


ft 

U • DO 




548-2A 


1.15 


2.3 


0.78 


0.43 


0.71 


0.67 


+ 


548-4D 


0. 69 


5 .3 


o ftn 

v i OU 


ft "35 


ft £Q 


ft 6 s ; 
u . oo 


-i- -i i 


548-6A 


0.39 


3.5 


0 85 




ft 

u . o y 


ft Ail 


T 


548-7A 


0. 82 


4.2 


0 83 


Ui JO 


ft fifl 

V ■ DO 


ft 

U . DO 


TT 


548-14A 


0 41 

W • "*X 


J * X 


ft ft? 

V • 0<£ 




ft £7 


ft £R 
U • DO 


+ 


548-18A 


v • J X 


X.J 


ft fiQ 


ft ^7 


ft an 

U . D / 


ft 

U • bo 




548-22A 


1. 41 


2.9 


0.75 


0 . 47 


ft 74 


ft £5 
V • DO 


tit 


54 8-2 4 A 


0.73 


3.7 


0.81 


0.38 


0 . 68 


0 . 65 


X.X 


548-41A 


0.40 


2.8 


0 .77 


0 .37 


0 . 68 




X, 


548-50A 


0.46 


4.0 


0.81 


0 .33 


0 . 68 


ft 65 

U • OJ 


X, 


548-57A 


0.50 


3.8 


0.80 


0.33 


0 . 67 


V . OJ 


++ 


549-5A 


0.63 


5.9 


0.69 


0.32 


0.65 


0.65 


+ 


549-7A 


0.51 


8.3 


0.78 


0.33 


0.67 


0.63 


++ 


5 4 9-2 OA 


0.67 


30 


0.88 


0.38* 


0.82* 


0.65* 


++++ 


549-34A 


0.43 


1.3 


0.69 


0.32 


0.64 


0.63 




549-39D 


0.83 


16 


0.83 


0,35 


0.71 


0.63 


+++ 


549-40A 


0.80 


4.9 


0.74 


0.33 


0.63 


0.64 


+ 


549-41C 


0.99 


13 


0.80 


0.38* 


0.79* 


0.65* 


+++ 


549-46A 


0.48 


7.7 


0.84 


0.34 


0.70 


0.64 


+ 


549-52A 


0.81 


9.2 


0.80 


0.39 


0.70 


0.65 


++ 


549-57A 


0.60 


15 


0.77 


0.35* 


0.85* 


0.64* 


+++ 


549-60D 


0.85 


11 


0.79 


0.37 


0.73 


0.65 


++ 



NORMAL WAS CALCULATED AS THE AVERAGE OF 6 SAMPLES FOR FREE AMINO 
ACID AND 23 SAMPLES FOR TOTAL AMINO ACIDS. 
* INDICATES AVERAGE OF AT LEAST 5 SAMPLES 
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Seeds derived from self-pollination of two plants 
transformed with the phaseolin 5' region/cts/i^s£I- 
M4/phaseolin 3' region, plants 549-5A and 549-40A, 
showed 3 kanamycin resistant to 1 kanamycin sensitive 
5 seedlings , indicative of a single site of insertion of 
the transgene. Progeny plants were grown, self- 
pollinated and seed was analyzed for segregation of the 
kanamycin marker gene. Progeny plants that were 
homozygous for the transgene insert, thus containing two 

10 copies of the gene cassette, accumulated approximately 2 
times as much threonine in their seed as their sibling 
heterozygous progeny with one copy of the gene cassette 
and about 8 times as much as seed without the gene. 
This demonstrates that the level of expression of the 

15 £. coli enzyme controls the accumulation of free 
threonine . 

E XAMPL E 9 
Transformation of Tobacco with the 

35S Promoter/ecodapA Chimeric Genes 

20 The 35S promoter/Cab leader /ecodagA/Nos 3* and 35S 

promoter/Cab leader/cts/ecodaE^/Nos 3', chimeric genes 
were isolated as 3.1, and 3.3 kb BamH I -Sal I fragments, 
respectively and inserted into BamH I -Sal I digested 
binary vector pZS97K (Figure 2) , yielding plasmids 

25 pBT462 and pBT463, respectively. The binary vector is 
described in Example 7 . 

The binary vectors containing the chimeric ec odapA 
genes were transferred by tri-parental matings to 
Acrrobacterium strain LBA4404/pAL4404, the Agrofracterium 

30 transformants used to inoculate tobacco leaf disks and 
the resulting transgenic plants regenerated by the 
methods set out in Example 7. 

To assay for expression of the chimeric genes in 
leaves of the transformed plants, protein was extracted 

35 as described in Example 7, with the following 
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modifications. The supernatant from the first ammonium 
sulfate precipitation, approximately 18mL, was mixed 
with an additional 12mL of cold, saturated ammonium 
sulfate. The mixture was set on ice for 30 min and 
5 centrifuged as described in Example 7 . The supernatant 
was decanted and the pellet, which contained the DHDPS 
protein, was resuspended in 1 mL of TNE and desalted by 
passage over a Sephadex G-25 M column (Column PD-10, 
Pharmacia) . 

10 The leaf extracts were assayed for DHDPS activity 

as described in Example 4. £. coli DHDPS could be 
distinguished from tobacco DHDPS activity by its 
increased resistance to lysine; E. coli DHDPS retained 
80-90% of its activity at O.lmM lysine, while tobacco 

15 DHDPS was completely inhibited at that concentration of 
lysine. One of ten transformants containing the 
chimeric gene, 35S promoter/Cab leader/eco dapA /Nos 3', 
showed £. coli DHDPS expression, while five of ten 
transformants containing the chimeric gene, 35S 

20 promoter/Cab leader/cts/ecodapA/Nos 3 l showed E. coli 
DHDPS expression. 

Free amino acids were extracted from leaves as 
described in Example 7. Expression of the chimeric 
gene, 35S promoter/Cab leader/cts/ecodap£/Nos 3 1 , but 

25 not 35S promoter/Cab leader/ecodacA/Nos 3' resulted in 
substantial increases in the level of free lysine in the 
leaves. Free lysine levels from two to 90-fold higher 
than untransformed tobacco were observed. 

The transformed plants were allowed to flower, 

30 self -pollinate and go to seed. Seeds from several lines 
transformed with the 35S promoter/Cab leader/ 
cts/ecodapA/Nos 3' gene were surface sterilized and 
germinated on agar plates in the presence of kanamycin. 
Lines that showed 3 kanamycin resistant to 1 kanamycin 

35 sensitive seedlings, indicative of a single site of 
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insertion of the transgenes, were identified. Progeny 
that were homozygous for the transgene insert were 
obtained from these lines using standard genetic 
analysis. The homozygous progeny were then 
5 characterized for expression of £. coli DHDPS in young 
and mature leaves and for the levels of free amino acids 
accumulated in young and mature leaves and in mature 
seeds . 

Expression of active £. coli DHDPS enzyme was 

10 clearly evident in both young and mature leaves of the 
homozygous progeny of the transformants (Table 4) . High 
levels of free lysine, 50 to 100-fold higher than normal 
tobacco plants, accumulated in the young leaves of the 
plants, but a much smaller accumulation of free lysine 

15 (2 to 8-fold) was seen in the larger leaves. 

Experiments that measure lysine in the phloem suggest 
that lysine is exported from the large leaves. This 
exported lysine may contribute to the accumulation of 
lysine in the small growing leaves, which are known to 

20 take up, rather than export nutrients. Since the larger 
leaves make up the major portion of the biomass of the 
plant, the total increased accumulation of lysine in the 
plant is more influenced by the level of lysine in the 
larger leaves. No effect on the free lysine levels in 

25 the seeds of these plants was observed (Table 4) . 



TABLE 4 

Progeny of BT463 transformants homozygous for 
35S promoter/Cab leader/cts/ecodasA/Nos 3 f 







LEAF 




£. COLT 


SEED FREE 




LEAF 


FREE 


AMINO ACID 


DHDPS 


AMINO ACID 


LINE 


SIZE 


K/L 


K/TOT 


OD/60Vmcr 


K/L 


NORMAL 


3 in. 


0.5 


0.006 


0 


0.5 


463-18C-2 


3 in. 


47 


0.41 


7.6 


0.4 


463-18C-2 


12 in. 


1 


0.02 


5.5 




463-25A-4 


3 in. 


58 


0.42 


6.6 


0.4 


463-25A-4 


12 in. 


4 


0.02 


12.2 
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LEAF 




£. COLI 


SEED FREE 




LEAF 


FREE 


AMINO ACID 


DHDPS 


AMINO ACID 


LINE 


SIZE 


K/L 


K/TOT 


OD/60 1 /mcr 


K/L 


463-38C-3 


3 in. 


28 


0.28 


6.1 


0.5 


463-38C-3 


12 in. 


2 


0.04 


8.3 





EXAMPLE 10 
Transformation of Tobarnn with i-h* 
Phaseolin Prompt er/erodapA rhimp rin Cfinfifi 

The chimeric gene cassettes, phaseolin 5 1 
5 region/ecodacA/phaseolin 3 1 region, and phaseolin 5' 

region/cts/ecodaca/phaseolin 3 1 region (Example 6) were 
isolated as approximately 2.6 and 2.8 kb Hind III 
fragments, respectively. These fragments were inserted 
into the unique Hind III site of the binary vector pZS97 

10 (Figure 3), yielding pBT506 and pBT534, respectively. 
This vector is described in Example 8. 

The binary vectors * containing the chimeric ecodapA 
genes were transferred by tri-parental matings to 
AgrQbacterium strain LBA4404/pAL4404, the Agrrobanfrerimn 

15 transformants used to inoculate tobacco leaf disks and 
the resulting transgenic plants were regeneratedby the 
methods set out in Example 7 . 

To assay for expression of the chimeric genes, the 
transformed plants were allowed to flower, self- 

20 pollinate and go to seed. Total seed proteins were 

extracted as described in Example 8 and immunologically 
analyzed as described in Example 7, with the following 
modification. The Western blot membranes were exposed 
to the DHDPS antibodies prepared in Example 4 at a 

25 1:5000 dilution of the rabbit serum using standard 

protocol provided by BioRad with their Immun-Blot Kit. 

Thirteen of fourteen transformants containing the 
chimeric gene, phaseolin 5' region/ecodacA/phaseolin 3' 
region and nine of thirteen transformants containing the 
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chimeric gene, phaseolin 5 f region/cts/ecodapA/phaseolin 
3 1 region, produced DHDPS protein detectable via Western 
blotting (Table 3) . Protein which reacted with the 
DHDPS antibody was of several sizes. Most of the 
5 protein was equal in size to DHDPS produced in £&lx, 
whether or not the chimeric gene included the 
chloroplast transit sequence. This indicated that the 
chloroplast targeting signal had been efficiently 
removed from the precursor protein synthesized. This 

10 further suggests the majority of the protein entered the 
chloroplast. In addition, some proteins of lower 
molecular weight were observed, probably representing 
breakdown products of the DHDPS polypeptide. 

To measure free amino acid composition and total 

15 amino acid composition of the seeds, free amino acids 
and total amino acids were extracted from mature seeds 
and analyzed as described in Example 8. Expression of 
either the ecodapA gene or cts/ecodapA had no effect on 
the total lysine or threonine composition of the seeds 

20 in any of the transformed lines (Table 5) . Several of 
the lines that were transformed with the phaseolin 5 1 
region/cts/ecodapA/phaseolin 3 1 chimeric gene were also 
tested for any effect on the free amino acid 
composition. Again, not even a modest effect on the 

25 lysine or threonine composition of the seeds was 
observed in lines expressing high levels of £. 
DHDPS protein (Table 5) . This was a surprising result, 
given the dramatic effect (described in Example 9) that 
expression of this protein has on the free lysine levels 

30 in leaves. 

One possible explanation for this was that the 
DHDPS protein observed via Western blot was not 
functional. To test this hypothesis, total protein 
extracts were prepared from mature seeds and assayed for 

35 DHDPS activity. Approximately 30-40 mg of seeds were 
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put into a 1.5 mL disposable plastic microfuge tube and 
ground in 0.25mL of 50mM Tris-HCl, 50mM NaCl, ImM EDTA 
(TNE) . The grinding was done using a motorized grinder 
with disposable plastic shafts designed to fit into the 
5 microfuge tube. The resultant suspensions were 

centrifuged for 5 min at room temperature in a microfuge 
to remove particulates. Approximately 0.1 mL of aqueous 
supernatant was removed between the pelleted material 
and the upper oil phase. The seed extracts were assayed 

10 for DHDPS activity as described in Example 4. £. coli 
DHDPS could be distinguished from tobacco DHDPS activity 
by its increased resistance to lysine; E. DHDPS 
retained about 50% of its activity at 0.4mM lysine, 
while tobacco DHDPS was completely inhibited at that 

15 concentration of lysine. High levels of £. coli DHDPS 
activity were seen in all four seed extracts tested 
eliminating this explanation. 

The presence of the cts sequence in the chimeric 
ecoda^A gene was essential for eliciting accumulation of 

20 high levels of lysine in leaves. Thus another possible 
explanation was that the cts sequence had somehow been 
lost during the insertion of the chimeric phaseolin 5' 
region/cts/ecodaEA/phaseolin 3' gene into the binary 
vector. PCR analysis of several of the transformed 

25 lines demonstrated the presence of the cts sequence, 
however, ruling out this possibility. 

A third explanation was that amino acids are not 
normally synthesized in seeds, and therefore the other 
enzymes in the pathway were not present in the seeds. 

30 The results of experiments presented in Example 8, 
wherein expression of phaseolin 5' region/cts/lys£- 
M4/phaseolin 3' gene resulted in accumulation of high 
levels of free threonine in seeds, indicate that this is 
not the case. 
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Taken together these results and the results 
presented in Example 9, demonstrate that expression of a 
lysine-insensitive DHDPS in either seeds or leaves is 
not sufficient to achieve accumulation of increased free 
5 lysine in seeds. 

TABLE 5 

BT506 Transf onnants : phaseolin 5 T region/ ecodapA/phaseolin 3' 
BT534 Transf ormants: phaseolin 5* region/cts/ecodac^/phaseolin 3 1 

SEED: FREE SEED: TOTAL E.. COL I 
AMINO ACIDS AMINO ACIDS DHDPS 
LINE K/L K/L T/L OP/gQ'/MG WESTERN 

NORMAL 0.49 1.34 0.35 0.68 

506-2B 0.34 0.66 + 

506-4B 0.33 0.67 + 

506-16A 0.34 0.67 + 

506-17A 0.36 0.55 7.7 +++ 

506-19A 0.37 0.45 ++ 

506-22A 0.34 0.67 ++ 

506-23B 0.35 0.67 ++ 

506-33B 0.34 0.67 ++ 

506-38B 0.36 0.69 8.7 

506-39A 0.37 0.70 

506-40A 0.36 0.68 

506-47A 0.32 0.68 

506-48A 0.33 0.69 

506-49A 0.33 0.69 

534-8A 0.34 0.66 

534-9A 0.36 0.67 

534-22B 0.43 1.32 0.39 0.51 4.9 

534-31A 0.34 0.66 

534-38A 0.35 1.49 0.42 0.33 

534-3 9A 0.38 0.69 

534-7A 0.34 0.67 

534-25B 0.35 0.67 
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LINE 


SEED: FREE 
AMINO ACIDS 
K/L 


SEED: 
AMINO 
K/L 


TOTAL 
ACIDS 
T/L 


£. £QLI 

DHDPS 

0D/60'/MG 


WESTERN 


534-34B 


0.80 


1.13 


0.42 


0.70 






534-35A 


0.43 


1.18 


0.33 


0.67 




+++ 


534-37B 


0.42 


1.58 


0.37 


0.68 






534-43A 






0.35 


0.68 




+++ 


534-48A 


0.46 


1.24 


0.35 


0.68 


6.2 


+++ 



EXAMPLE 11 

Transformation of Tobacco with thP 
5 35S Promoter/cts/dapA Plus 35S Promoter/cts/lyfin-M4 

Chimeric Genet* 
The 35S promoter/Cab leader /cts/ecoda pA /Nos 3 1 , and 
35S promoter/Cab leader/ cts/l¥££-M4/Nos 3 f chimeric 
genes were combined in the binary vector pZS97K 

10 (Figure 2) . The binary vector is described in 

Example 7. An oligonucleotide adaptor was synthesized 
to convert the BamH I site at the 5' end of the 35S 
promoter/Cab leader/cts/l£££-M4/Nos 3 f chimeric gene 
(see Figure 1) to an EcoR I site. The 35S promoter/Cab 

15 leader/cts/l£S£-M4/Nos 3 J chimeric gene was then 

isolated as a 3.6 kb EcoR I fragment from plasmid pBT540 
(Example 6) and inserted into pBT463 (Example 9) 
digested with EcoR I, yielding plasmid pBT564. This 
vector has both the 35S promoter /Cab 

20 leader /cts/ecodacA/Nos 3 1 , and 35S promoter/Cab 

leader/cts/I^aH-M4/Nos 3' chimeric genes inserted in the 
same orientation. 

The binary vector containing the chimeric ecodapA 
and lysC -M4 genes was transferred by tri-parental 

25 matings to AarobacteriMm strain LBA4404/pAL4404, the 
AgrQbarrt.erium transformants used to inoculate tobacco 
leaf disks and the resulting transgenic plants 
regenerated by the methods set out in Example 7. 
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To assay for expression of the chimeric genes in 
leaves of the transformed plants, protein was extracted 
as described in Example 7 for AKIII, and as described in 
Example 9 for DHDPS. The leaf extracts were assayed for 
5 DHDPS activity as described in Examples 4 and 9, 

£. coli DHDPS could be distinguished from tobacco DHDPS 
activity by its increased resistance to lysine; E. soli 
DHDPS retained 80-90% of its activity at O.lmM lysine, 
while tobacco DHDPS was completely inhibited at that 
10 concentration of lysine. Extracts were characterized 
immunologically for expression of AKIII and DHDPS 
proteins via Western blots as described in Examples 7 
and 10. 

Ten of twelve transformants expressed £. coli DHDPS 

15 enzyme activity (Table 6) . There was a good correlation 
between the level of enzyme activity and the amount of 
DHDPS protein detected immunologically. As described in 
Example 7, the AK assay was not sensitive enough to 
detect enzyme activity in these extracts. However, 

20 AKIII-M4 protein was detected immunologically in eight 
of the twelve extracts. In some transformants, 564-21A 
and 47A, there was a large disparity between the level 
of expression of DHDPS and AKIII-M4, but in 10 of 12 
lines there was a good correlation. 

25 Free amino acids were extracted from leaves and 

analyzed for amino acid composition as described in 
Example 7. In the absence of significant AKIII-M4, the 
level of expression of the chimeric gene, 35S 
promoter/Cab leader/ cts/eco£isE&/Nos 3 r determined the 

30 level of lysine accumulation (Table 6) . Compare lines 
564-21A, 47A and 39C, none of which expresses 
significant AKIII-M4. Line 564-21A accumulates about 
10-fold higher levels of lysine than line 564-47A which 
expresses a lower level of £- coli DHDPS and 40-fold 

35 higher levels of lysine than 564-39C which expresses no 
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£. coli DHDPS. However/ in transf ormants that all 
expressed similar amounts of £. coli DHDPS (564-18A, 
56A, 36E f 55B, 47A), the level of expression of the 
chimeric gene, 35S promoter/Cab leader/cts/ lysC -M4/Nos 
5 3' r controlled the level of lysine accumulation. Thus 
it is clear that although expression of 35S promoter/Cab 
leader/cts/l£££-M4/Nos 3' has no effect on the free 
amino acid levels of leaves when expressed alone (see 
Example 7), it can increase lysine accumulation when 
10 expressed in concert with the 35S promoter/Cab 

leader/cts/ecodap^/Nos 3' chimeric gene. Expression of 
these genes together did not effect the level of any 
other free amino acid in the leaves. 

TABLE 6 

BT564 Transf ormants : 35S promoter/Cab leader /cts/ecodapA/Nos 3' 

35S promoter/Cab leader/ cts/Jjis£-M4/Nos 3' 

£. COLI 



LINE 


FREE AA LEAF 
nmol/4mg 


FREE 


AA LEAF 


DHDPS 
U/MG/HR 


WESTERN 
DHDPS 


WESTERN 
AK-III 




TOT 


K 


K/L 


K/TOT 








564-21A 


117 


57 


52 


0.49 


2.4 


+++ 


+/- 


564-18A 


99 


56 


69 


0.57 


1.1 


++ 


++ 


564-56A 


104 


58 


58 


0.56 


1.5 


++ 


++ 


564-36E 


85 


17 


17 


0.20 


1.5 


++ 


+++ 


564-55B 


54 


5 


9.1 


0.10 


1.0 


++ 


+ 


564-47A 


18 


1 


4.8 


0.06 


0.8 


++ 




564-35A 


37 


7 


13 


0.18 


0.3 


+ 


++ 


564-60D 


61 


3 


4.5 


0.06 


0.2 


+ 


++ 


564-45A 


46 


4 


8.1 


0.09 


0.4 


+ 


+ 


564-44B 


50 


1 


1.7 


0.02 


0.1 


+/- 




564-49A 


53 


1 


1.0 


0.02 


0 


+ /- 




564-39C 


62 


1 


1.4 


0.02 


0 







Free amino acids were extracted from mature seeds 
derived from self-pollinated plants and quantitated as 
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described in Example 8. There was no significant 
difference in the free amino acid content of seeds from 
untransformed plants compared to that from the plants 
showing the highest free lysine accumulation in leaves, 
5 i.e. plants 564-18A, 564-21A, 564-36E, 564-56A. 

EXAMPLE 12 
Transformation of Tobacco with the 
PhaRfiolin Promoter /cts/ecodapA plus 
Phaseolj n Promoter/cts/lvsC-M4 Chimeric Genes 

10 The chimeric gene cassettes , phaseolin 5 f 

region/cts/ecodaeA/phaseolin 3' region and phaseolin 5 f 
region/cts/lys£-M4/phaseolin 3 r (Example 6) were 
combined in the binary vector pZS97 (Figure 3) . The 
binary vector is described in Example 8 . To accomplish 

15 this the phaseolin 5 f region/cts/ecodaEA/phaseolin 3 f 

chimeric gene was isolated as a 2.7 kb Hind III fragment 
and inserted into the Hind III site of vector pUC1318 
[Kay et al (1987) Nucleic Acids Res. 6: 2778] , yielding 
pBT568. It was then possible to digest pBT568 with 

20 BamH I and isolate the chimeric gene on a 2.7 kb BamH I 
fragment. This fragment was inserted into BamH I 
digested pBT5 4 9 (Example 8), yielding pBT570 . This 
binary vector has both chimeric genes, phaseolin 5' 
region/cts /eco dapA/ phaseolin 3' gene and phaseolin 5 f 

25 region/cts / lysC- M4 /phaseolin 3 f inserted in the same 
orientation . 

The binary vector pBT570 was transferred by tri- 
parental mating to Aarobacterium strain LBA4404/pAL4404, 
the Agrobacterium transformants used to inoculate 
30 tobacco leaf disks and the resulting transgenic plants 
regenerated by the methods set out in Example 7. 

To assay for expression of the chimeric genes in 
the seeds of the transformed plants, the plants were 
allowed to flower, self -pollinate and go to seed. Total 
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proteins were extracted from mature seeds and analyzed 
via western blots as described in Example 8, 

Twenty-one of twenty-five transformants expressed 
the DHDPS protein and nineteen of these also expressed 
5 the AKIII protein (Table 7) . The amounts of the 

proteins expressed were related to the number of gene 
copies present in the transformants; the highest 
expressing lines, 570-4B, 570-12C, 570-59B and 570-23B, 
all had two or more sites of insertion of the gene 

10 cassette based on segregation of the kanamycin marker 
gene, Enzymatically active E. coli DHDPS was observed 
in mature seeds of all the lines tested wherein the 
protein was detected. 

To measure free amino acid composition of the 

15 seeds, free amino acids were extracted from mature seeds 
and analyzed as described in Example 8. There was a 
good correlation between transformants expressing higher 
levels of both DHDPS and AKIII protein and those having 
higher levels of free lysine and threonine. The highest 

20. expressing lines (marked by asterisk in Table 7) showed 
up to a 2-fold increase in free lysine levels and up to 
a 4-fold increase in the level of free threonine in the 
seeds . 

In the highest expressing lines it was possible to 
25 detect a high level of a-aminoadipic acid. This 

compound is known to be an intermediate in the 
catabolism of lysine in cereal seeds/ but is normally 
detected only via radioactive tracer experiments due to 
its low level of accumulation. The build-up of high 

30 levels of this intermediate indicates that a large 

amount of lysine is being produced in the seeds of these 
transformed lines and is passing through the catabolic 
pathway. The build-up of a-aminoadipic acid was not 
observed in transformants expressing only £. coli DHDPS 

35 or only AKIII-M4 in seeds. These results show that it 
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is necessary to express both enzymes simultaneously to 
produce high levels of free lysine. 



TABLE 7 

BT570 Transf orraants : phaseolin 5 'region /cts/lxa£-M4 /phaseolin 3 r region 
phaseolin 5 ' region/ cts/ecoiiaB&/phaseolin 3' region 

FREE AMINO TOTAL AMINO WESTERN WESTERN fi. CQU 
ACIDS/SEED ACIDS /SEED £. COLT £. COLT DHDPS Progeny 



LINE 


K/L 


T/L 


K/L 


T/L DHDPS 


AKIII U/MG/HR 


Kan r :Kan s 


NORMAL 


0.49 


1.3 


0.35 


0.68 - 


— 




570-4B 


0.31 


2.6 


0.34 


0 . 64 +++ 


++ 


15:1 


570-7C 


0.39 


2.3 


0.34 


0.64 ++ 


+ 




570-8B 


0.29 


2.1 


0.34 


0.63 + 


- 




570-12C* 


0.64 


5.1 


0.36 


0.68 ++-H- 


++++ > 4.3 


>15:1 


570-18A 


0.33 


3.0 


0.35 


0 . 65 ++ 


++ 


15:1 


570-24A 


0.33 


2.0 


0.34 


0 . 65 ++ 


— 




570-37A 


0.33 


2.1 


0.34 


0.64 +/- 


+/- 




570-44A 


0.29 


2.1 


0.34 


0.64 ++ 


+ 




570-46B 


0.41 


2.1 


0.35 


0.65 ++ 


+ 




570-51B 


0.33 


1.5 


0.33 


0.64 - 


0 




570-59B* 


0.46 


3.0 


0.35 


0 . 65 +++ 


+++ 2.6 


>15:1 


570-80A 


0.31 


2.2 


0.34 


0.64 ++ 


+ 




570-11A 


0.28 


2.3 


0.34 


0.67 ++ 


++ 


3:1 


570-17B 


0.27 


1.6 


0.34 


0.65 - 






570-20A 


0.41 


2.3 


0.35 


0 . 67 ++ 


+ 




570-21B 


0.26 


2.4 


0.34 


0.68 ++ 


+ 




570-23B* 


0.40 


3.6 


0.34 


0.68 +++ 


+++ 3.1 


63:1 


570-25D 


0.30 


2.3 


0.35 


0.66 ++ 


+/- 




570-26A 


0.28 


1.5 


0.34 


0.64 - 






570-32A 


0.25 


2.5 


0.34 


0.67 ++ 


+ 




570-35A 


0.25 


2.5 


0.34 


0.63 ++ 


++ 


3:1 


570-38A-1 


0.25 


2.6 


0.34 


0.64 ++ 


++ 


3:1 


570-38A-3 


0.33 


1.6 


0.35 


0.63 - 






570-42A 


0.27 


2.5 


0.34 


0.62 ++ 


++ 


3:1 


570-45A 


0.60 


3.4 


0.39 


0.64 ++ 


++ 


3:1 


* indicates free 


amino 


acid ^ 


sample has a-aminoadipic acid 
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Use of the cts/lvsC-M4 Chimeric Gene as a Selectable 

Marker for labasgQ Transforation 

The 35S promoter /Cab leader /cts/lys£-M4/Nos 3' 
chimeric gene in the binary vector pZS97K (pBT542, see 
Example 7) was used as a selectable genetic marker for 
transformation of tobacco. High concentrations of 
lysine plus threonine inhibit growth of shoots from 
tobacco leaf disks. Expression of active lysine and 
threonine insensitive AKIII-M4 reverses this growth 
inhibition (see Example 7) . 

The binary vector pBT542 was transferred by tri- 
parental mating to Acrrobacterium strain LBA4404/pAL4404, 
the Acrrobacteriiim transf ormants used to inoculate 
tobacco leaf disks and the resulting transformed shoots 
were selected on shooting medium containing 3mM lysine 
plus 3mM threonine. Shoots were transferred to rooting 
media containing 3mM lysine plus 3mM threonine. Plants 
were grown from the rooted shoots. Leaf disks from the 
plants were placed on shooting medium containing 3mM 
lysine plus 3mM threonine. Transformed plants were 
identified by the shoot proliferation which occurred 
around the leaf disks on this medium. 

EXAMPLE 14 
Transformation of Tob acco with the 
35S Promoter /cts/cordapA Chimeric r^n a 

The 35S promoter/Cab leader/cts/cordap&/Nos 3' 
chimeric gene was isolated as a 3.0 kb BamH I-Sal I 
fragment and inserted into BamH I-Sal I digested binary 
vector p2S97K (Figure 2), yielding plasmid pFS852. The 
binary vector is described in Example 7. 

The binary vector containing the chimeric cordapA 
gene was transferred by tri-parental mating to 
Agrobacterium strain LBA4404/pAL4404, the Acrrobacterium 
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transformant was used to inoculate tobacco leaf disks 
and the resulting transgenic plants were regenerated by 
the methods set out in Example 7 . 

To assay for expression of the chimeric gene in * 
5 leaves of the transformed plants, protein was extracted 

as described in Example 7, with the following ? 
modifications. The supernatant from the first ammonium 
sulfate precipitation, approximately 18mL, was mixed 
with an additional 12mL of cold, saturated ammonium 

10 sulfate. The mixture was set on ice for 30 min and 

centrifuged as described in Example 7. The supernatant 
was decanted and the pellet, which contained the DHDPS 
protein, was resuspended in 1 mL of TNE and desalted by 
passage over a Sephadex G-25 M column (Column PD-10, 

15 Pharmacia) . 

The leaf extracts were assayed for DHDPS protein 
and enzyme activity as described in Example 4. 
Corynebacteria DHDPS enzyme activity could be 
distinguished from tobacco DHDPS activity by its 

20 insensitivity to lysine inhibition. Eight of eleven 
transformants showed Corynebacteria DHDPS expression, 
both as protein detected via western blot and as active 
enzyme . 

Free amino acids were extracted from leaves as 
25 described in Example 7. Expression of CPKynebacteria 
DHDPS resulted in large increases in the level of free 
lysine in the leaves (Table 8) . However, there was not 
a good correlation between the level of expression of 
DHDPS and the amount of free lysine accumulated. Free 
30 lysine levels from 2 to 50-fold higher than ( 
untrans formed tobacco were observed. There was also a 
2 to 2.5-fold increase in the level of total leaf lysine f 
in the lines that showed high levels of free lysine . 
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TABLE R 

FS586 transf ormant3 : 35S promoter/Cab leader/cts/cordajaA/Nos 3' 



T TKTT7 

LINE 


FREE AMINO 
ACIDS /LEAF 
K/L 


TOTAL AMINO 
ACIDS /LEAF 

V /T 

K/L 


WESTERN 
CORYNE .. 
DHDPS 


CORYNE 

DHDPS 

U/MG/HI 


NORMAL 


0.5 


0.8 






FS586-2A 


1.0 


0.8 






FS586-4A 


0.9 


0.8 


+ 


6.1 


FS586-11B 


3.6 


0.8 


+ 


3.4 


FS586-11D 


26 


2.0 


+ 


3.5 


FS586-13A 


2.4 


0.8 


+ 


3.5 


FS586-19C 


5.1 


0.8 


+ 


3.1 


FS586-22B 


>15 


1.5 




2.3 


FS586-30B 




0.8 






FS586-38B 


18 


1.5 


++ 


3.9 


FS586-51A 


1.3 


0.8 






FS586-58C 


1.2 


0.8 


+ 


5.1 



The plants were allowed to flower, self-pollinate 
and go to seed. Mature seed was harvested and assayed 
for free amino acid composition as described in 
5 Example 8.. There was no difference in the free lysine 
content of the transformants compared to untransformed 
tobacco seed. 

EXAMPLE 15 
Transformation of Tobacco with the 
10 KTI3 promoter /cts/cordapA or 

Phaseolin Promoter/ cts/co rdapA plus 
Phaseolin Promoter/cts/lysC-M4 Ch imeric Genes 
The chimeric gene cassettes, KTI3 5* region/cts/ 
cordapA/KTI3 3' region and phaseolin 5' region/cts/ 
15 JLya£-M4 /phaseolin 3' as well as phaseolin 5' region/cts/ 
cordapA/phaseolin 3' region and phaseolin 5' region/cts/ 
l^s£-M4 /phaseolin 3' (Example 6) were combined in the 
binary vector pZS97 (Figure 3) . The binary vector is 
described in Example 8 . 
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To accomplish this the KTI3 5» region/cts/cor.d2B&/ 
KTI3 3' region chimeric gene cassette was isolated as a 
3.3 kb BamH I fragment and inserted into BamH I digested 
pBT549 (Example 8) , yielding pFS883. This binary vector 
5 has the chimeric genes, KTI3 5 f region/cts/cordap&/KTI3 
3 f region and phaseolin 5 l region/cts/l^s£l-M4/phaseolin 
3' region inserted in opposite orientations. 

The phaseolin 5» region/cts/cordap£/phaseolin 
3 r region chimeric gene cassette was modified using 

10 oligonucleotide adaptors to convert the Hind III sites 
at each end to BamH I sites. The gene cassette was then 
isolated as a 2.7 kb BamH I fragment and inserted into 
BamH I digested pBT549 (Example 8), yielding pFS903. 
This binary vector has both chimeric genes , phaseolin 5 f 

15 region/cts /corda pA /phaseolin 3' region and phaseolin 5' 
region/cts/l£££-M4/phaseolin 3 1 region inserted in the 
same orientation. 

The binary vectors pFS883 and pFS903 were 
transferred by tri-parental mating to Agrobacterium 

20 strain LBA4404/pAL4404, the Agrobacterium transformants 
were used to inoculate tobacco leaf disks and the 
resulting transgenic plants were regenerated by the 
methods set out in Example 7 . 

To assay for expression of the chimeric genes in 

25 the seeds of the transformed plants, the plants were 

allowed to flower, self-pollinate and go to seed. Total 
proteins were extracted from mature seeds and analyzed 
via western blots as described in Example 8. 

Twenty-one of twenty-two transformants tested 

30 expressed the DHDPS protein and eighteen of these also 
expressed the AKIII protein (Table 8) . Enzymatically 
active Corynebacteria DHDPS was observed in mature seeds 
of all the lines tested wherein the protein was detected 
except one. 
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To measure free amino acid composition of the 
seeds, free amino acids were extracted from mature seeds 
and analyzed as described in Example 8 . There was a 
good correlation between transformants expressing higher 
5 levels of both DHDPS and AKIII protein and those having 
higher levels of free lysine and threonine. The highest 
expressing lines showed up to a 3-fold increase in free 
lysine levels and up to a 8-fold increase in the level 
of free threonine in the seeds . As was described in 
10 Example 12 , a high level of a-aminoadipic acid, 

indicative of lysine catabolism, was observed in many of 
the transformed lines (indicated by asterisk in 
Table 9) . There was no major difference in the free 
amino acid composition or level of protein expression 
15 between the transformants which had the KTI3 or 

Phaseolin regulatory sequences driving expression of the 
Corynebacteria dhdps gene. 

TABLE 9 

FS883 Transformants: phaseolin 5' region/cts/lys£-M4 /phaseolin 3' 

KTI3 5 r region/cts/cordae&/KTI3 3' 

FS903 Transformants: phaseolin 5' region/cts/lys£-M4 /phaseolin 3 f 

phaseolin 5 1 region/ cts/cordasA/phaseolin 3 1 





FREE 


AMINO 


WESTERN 


WESTERN 


CORYNE ■ 






ACIDS/ SEED 


CQRYNE . 


£. COLT 


DHDPS 


Progeny 


LINE 


K/L 


T/L 


DHDPS 


AKIII 


D/MG/HR 


Kan r :Kan s 


NORMAL 


0.5 


1.3 










FS883-4A 


0.9 


4.0 


+ 


+ 




>15:1 


FS883-11A 


1.0 


3.5 


++ 


++ 


3.1 


3:1 


FS883-14B 


0.5 


2.5 


++ 


++ 






FS883-16A* 


0.7 


10.5 


+ 


+++ 


0 




FS883-17A* 


1.0 


5.0 


+++ 


+++ 


7.0 




FS883-18C* 


1.2 


3.5 


++ 


+ 


5.8 


3:1 


FS883-21A 


0.5 


1.5 


+ 


+/- 






FS883-26B* 


1.1 


3.6 


++ 


++ 


2.4 




FS883-29B 


0.5 


1.5 


+ 




0.4 




FS883-32B 


0.7 


2.4 


++ 


+ 


1.5 


3:1 
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FREE AMINO WESTERN WESTERN CQRYNE ■ 
ACIDS /SEED CQRYNE. fi. COLT DHDPS Progeny 
LINE K/L T/L DHDPS AKIII U/MG/HR Kan r :Kan s 

FS883-38B* 1.1 11.3 + ++ 2.0 

FS883-59C* 1.4 6.1 + + 0.5 15:1 

FS903-3C 0.5 1.8 + +++ 

FS903-8A* 0.8 2.1 +++ ++++ 

FS903-9B 0.6 1.8 ++ ++ 4.3 

FS903-10A 0.5 1-5 

FS903-22F 0.5 1.8 ++ ++ 0.9 

FS903-35B* 0.8 2.1 ++ ++ 
FS903-36B 0.7 1.5 + 

FS903-40A 0.6 1.8 + + 

FS903-41A* 1.2 2.0 ++ +++ 
FS903-42A 0.7 2.2 ++ +++ 5.4 

FS903-44C 0.5 1.9 

FS903-53B 0.6 1.9 

* indicates free amino acid sample has a-aminoadipic acid 

Free amino acid composition and expression of 
bacterial DHDPS and AKIII proteins was also analyzed in 
developing seeds of two lines that segregated as single 
5 gene cassette insertions (see Table 10) . Expression of 
the DHDPS protein under control of the KTI3 promoter was 
detected at earlier times than that of the AKIII protein 
under control of the Phaseolin promoter, as expected. 
At 14 days after flowering both proteins were expressed 

10 at a high level and there was about an 8-fold increase 
in the level of free lysine compared to normal seeds. 
These results confirm that simultaneous expression of 
lysine insensitive DHDPS and lysine-insensitive AK 
results in the production of high levels of free lysine 

15 in seeds. Free lysine does not continue to accumulate 
to even higher levels, however. In mature seeds free 
lysine is at a level 2 to 3-fold higher than in normal 
mature seeds, and the lysine breakdown product 
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a-aminoadipic acid accumulates. These results provide 
further evidence that lysine catabolism occurs in seeds 
and prevents accumulation of the high levels of free 
lysine produced in transformants expressing lysine 
5 insensitive DHDPS and lysine insensitive AK. 

TABLE 10 

Developing seeds of FS883 Transformants: 

phaseolin 5* region/cts/Jj£2£-M4 /phaseolin 3 1 region 

KTI3 5' region/ cts/cordapA/KTl3 3' region 



LINE 


DAYS AFTER 
FLOWERING 


FREE AMINO 
ACIDS /SEED 
K/L T/L 


WESTERN 

CORYNE. 

DHDPS 


WESTERN 
£. COLT 
AKIII 


FS883-18C 


9 


1.1 


2.1 






FS883-18C 


10 


1.4 


3.3 


+/- 




FS883-18C 


11 


1.4 


2.5 


+ 




FS883-18C 


14 


4.3 


1.0 


++ 


++ 


FS883-18C* 


MATURE 


1.2 


3.5 


+++ 


++ 


FS883-32B 


9 


1.3 


2.9 


+ 




FS883-32B 


10 


1.6 


2.7 


+ 




FS883-32B 


11 


1.4 


2.3 


+ 




FS883-32B* 


14 


3.9 


1.3 


++ 


++ 


FS883-32B* 


MATURE 


0.7 


2.4 


+++ 


++ 



* indicates free amino acid sample has a-aminoadipic acid 



EXAMPLE Ifi 

10 Transformation of Cannla with thg 

Phaseolin Promotftr/ctfi/oorriapA anri 
Phaseolin Promoter/cts/lYsC-M4 Chimeric Sfiliaa 
The chimeric gene cassettes, phaseolin 5' region/ 
cts/cordapA/phaseolin 3' region, phaseolin 5' region/ 
15 cts/jysC-M4/phaseolin 3 1 , and phaseolin 5» region/ 
cts/cordapA/phaseolin 3' region plus phaseolin 5* 
region/cts/l^s£-M4 /phaseolin 3' (Example 6) were 
inserted into the binary vector pZS199 (Figure 4), which 
is similar to pSZ97K described in Example 8. In pZS199 
20 the 35S promoter from Cauliflower Mosaic Virus replaced 
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the nos promoter driving expression of the NPT II to 
provide better expression of the marker gene, and the 
orientation of the polylinker containing the multiple 
restriction endonuclease sites was reversed. 
5 To insert the phaseolin 5' region/ cts/corda pA/ 

phaseolin 3" region 3", the gene cassette was isolated as 
a 2.7 kb BamH I fragment (as described in Example 15) 
and inserted into BamH I digested pZS199, yielding 
plasmid pFS926. This binary vector has the chimeric 

10 gene, phaseolin 5 r region/cts /cordapA/phaseolin 3' 

region inserted in the same orientation as the 35S/NPT 
II/nos 3 1 marker gene . 

To insert the phaseolin 5* region/cts/ lysC- 
M4/phaseolin 3 r region, the gene cassette was isolated 

15 as a 3.3 kb EcoR I to Spe I fragment and inserted into 
EcoR I plus Xba I digested pZS199, yielding plasmid 
pBT593. This binary vector has the chimeric gene, 
phaseolin 5 1 region/cts /lxs£-M4 /phaseolin 3 f region 
inserted in the same orientation as the 35S/NPT II/nos 

20 3 1 marker gene. 

To combine the two cassettes, the EcoR I site of 
pBT593 was converted to a BamH I site using 
oligonucleotide adaptors, the resulting vector was cut 
with BamH I and the phaseolin 5' region/cts /corda pA/ 

25 phaseolin 3 1 region gene cassette was isolated *as a 
2.7 kb BamH I fragment and inserted, yielding pBT597. 
This binary vector has both chimeric genes, phaseolin 5 1 
region/cts /cordafiA/phaseolin 3' region and phaseolin 5 1 
region/cts /lysC- M4 /phaseolin 3 f region inserted in the 

30 same orientation as the 35S/NPT II/nos 3 f marker gene. 

Brassica na£ii£ cultivar "Westar" was transformed by 
co-cultivation of seedling pieces with disarmed 
ftgrobacterium tumefaciens strain LBA4404 carrying the 
the appropriate binary vector. 
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£. naEllS. seeds were sterilized by stirring in 10% 
Chlorox, 0.1% SDS for thirty min, and then rinsed 
thoroughly with sterile distilled water. The seeds were 
germinated on sterile medium containing 30 mM CaC12 and 
5 1.5% agar, and grown for six d in the dark at 24°C. 
Liquid cultures of Acrrobacterium for plant 
transformation were grown overnight at 28°C in Minimal A 
medium containing 100 mg/L kanamycin. The bacterial 
cells were pelleted by centrifugation and resuspended at 

10 a concentration of 10 8 cells/mL in liquid Murashige and 
Skoog Minimal Organic medium containing 100 uM 
acetosyringone . 

fi. napns. seedling hypocotyls were cut into 5 mm 
segments which were immediately placed into the 

15 bacterial suspension. After 30 min, the hypocotyl pieces 
were removed from the bacterial suspension and placed 
onto BC-35 callus medium containing 100 uM 
acetosyringone. The plant tissue and Acrrobacteria were 
co-cultivated for three d at 24°C in dim light. 

20 The co-cultivation was terminated by transferring 

the hypocotyl pieces to BC-35 callus medium containing 
200 mg/L carbenicillin to kill the Aarobacteria , and 
25 mg/L kanamycin to select for transformed plant cell 
growth. The seedling pieces were incubated on this 

25 medium for three weeks at 24°C under continuous light. 

After three weeks, the segments were transferred to 
BS-48 regeneration medium containing 200 mg/L 
carbenicillin and 25 rag/L kanamycin. Plant tissue was 
subcultured every two weeks onto fresh selective 

30 regeneration medium, under the same culture conditions 
described for the callus medium. Putatively transformed 
calli grew rapidly on regeneration medium; as calli 
reached a diameter of about 2mm, they were removed from 
the hypocotyl pieces and placed on the same medium 

35 lacking kanamycin 
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Shoots began to appear within several weeks after 
transfer to BS-48 regeneration medium. As soon as the 
shoots formed discernable stems, they were excised from 
the calli, transferred to MSV-1A elongation medium, and 
5 moved to a 16:8-h photoperiod at 24°C. 

Once shoots had elongated several internodes, they 
were cut above the agar surface and the cut ends were 
dipped in Rootone. Treated shoots were planted directly 
into wet Metro-Mix 350 soiless potting medium. The pots 
10 were covered with plastic bags which were removed when 
the plants were clearly growing, after about ten d. 
Results of the transformation are shown in Table 11. 
Transformed plants were obtained with each of the binary 
vectors, 

15 Plants were grown under a 16:8-h photoperiod, with 

a daytime temperature of 23°C and a nightime temperature 
of 17°C. When the primary flowering stem began to 
elongate, it was covered with a mesh pollen-containment 
bag to prevent outcrossing. Self-pollination was 

20 facilitated by shaking the plants several times each 
day. Mature seeds derived from self-pollinations were 
harvested about three months after planting. 



Minimal ft Bacterial Growth Medium 
25 Dissolve in distilled water: 

10*5 g potassium phosphate, dibasic 
4.5 g potassium phosphate, monobasic 
1,0 g ammonium sulfate 
0.5 g sodium citrate, dihydrate 
30 Make up to 979 mL with distilled water 
Autoclave 

Add 20 mL filter-sterilized 10% sucrose 
Add 1 mlr filter-sterilized 1 M MgS04 
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Brassica Callus Medium BC-35 
Per liter: 

Murashige and Skoog Minimal Organic Medium 
(MS salts, 100 mg/L i-inositol, 0.4 mg/L thiamine; GIBCO 
5 #510-3118) 

30 g sucrose 
18 g mannitol 
0.5 mg/L 2,4-D 
0.3 mg/L kinetin 
10 0.6% agarose 

pH 5.8. 

Brassica Regeneration Medium ftft-4R 

Murashige and SJcoog Minimal Organic Medium 
Gamborg B5 Vitamins (SIGMA #1019) 
10 g glucose 
250 mg xylose 
600 mg MES 
0.4% agarose 
pH 5.7 

Filter-sterilize and add after autoclaving: 
2.0 mg/L zeatin 
0.1 mg/L IAA 

25 Brassica Shoot Elongation Medium MSV-1A 

Murashige and Skoog Minimal Organic Medium 

Gamborg B5 Vitamins 

10 g sucrose 

0.6% agarose 
30 pH 5.8 



15 



20 
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TftBIiE 11 
Canola transformants 



NUMBER OF 



BINARY 
VECTOR 


NUMBER OF 
CUT ENDS 


NUMBER OF 
KAN R CALLI 


SHOOTING 
CALLI 


NUMBER OF 
PLANTS 


PZS199 


120 


41 


5 


2 


pFS92 6 


600 


278 


52 


28 


pBT593 


eoo 


70 


10 


3 


pBT597 


600 


223 


40 


23 



EXAMPLE 17 

Transformation of Maize Using a Chimeric lysC-M4 Gene 

as a Selectable Marker 
5 Embryogenic callus cultures were initiated from 

immature embryos (about 1.0 to 1.5 mm) dissected from 
kernels of a corn line bred for giving a "type II 
callus" tissue culture response. The embryos were 
dissected 10 to 12 d after pollination and were placed 

10 with the axis-side down and in contact with agarose- 
solidified N6 medium [Chu et al. (1974) Sci Sin 
18:659-668] supplemented with 0.5 mg/L 2,4-D (N6-0.5) . 
The embryos were kept in the dark at 27°C. Friable 
embryogenic callus consisting of undifferentiated masses 

15 of cells with somatic proembryos and somatic embryos 
borne on suspensor structures proliferated from the 
scutellum of the immature embryos. Clonal embryogenic 
calli isolated from individual embryos were identified 
and sub-cultured on N6-0.5 medium every 2 to 3 weeks. 

20 The particle bombardment method was used to 

transfer genes to the callus culture cells. A 
Biolistic, PDS-1000/He (BioRAD Laboratories, Hercules, 
CA) was used for these experiments. 

The plasmid pBT573, containing the chimeric gene 

25 HH534 5' region/ mcts/l£s£-M4/HH2-l 3 1 region (see 

Example 6) designed for constitutive gene expression in 
corn, was precipitated onto the surface of gold 
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particles. To accomplish this 2.5 p.g of pBT573 (in 
water at a concentration of about 1 mg/mL) was added to 
25 mL of gold particles (average diameter of 1.5 Jim) 
suspended in water (60 mg of gold per mL) . Calcium 
5 chloride (25 mL of a 2.5 M solution) and spermidine (10 
mL of a 1.0 M solution) were then added to the gold-DNA 
suspension as the tube was vortexing. The gold 
particles were centrifuged in a microfuge for 10 sec and 
the supernatant removed. The gold particles were then 

10 resuspended in 200 mL of absolute ethanol, were 

centrifuged again and the supernatant removed. Finally, 
the gold particles were resuspended in 25 mL of absolute 
ethanol and sonicated twice for one sec. Five ^L of the 
DNA-coated gold particles were then loaded on each macro 

15 carrier disk and the ethanol was allowed to evaporate 
away leaving the DNA-covered gold particles dried onto 
the disk. 

Embryogenic callus (from the callus line designated 
#132.2.2) was arranged in a circular area of about 6 cm 

20 in diameter in the center of a 100 X 20 mm petri dish 
containing N6-0.5 medium supplemented with 0.25M 
sorbitol and 0.25H mannitol. The tissue was placed on 
this medium for 2 h prior to bombardment as a 
pretreatment and remained on the medium during the 

25 bombardment procedure. At the end of the 2 h 

pretreatment period, the petri dish containing the 
tissue was placed in the chamber of the PDS-1000/He. 
The air in the chamber was then evacuated to a vacuum of 
28 inch of Hg. The macrocarrier was accelerated with a 

30 heliu- shock wave using a rupture membrane that bursts 
when the He pressure in the shock tube reaches 1100 psi. 
The tissue was placed approximately 8 cm from the 
stopping screen. Four plates of tissue were bombarded 
with the DNA-coated gold particles. Immediately 

35 following bombardment, the callus tissue was transferred 
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to N6-0.5 medium without supplemental sorbitol or 
maimitol. 

Seven d after bombardment small (2-4 mm diameter) 
clumps of callus tissue were transferred to N6-0.5 
5 medium lacking casein or proline, but supplemented with 
2mM each of lysine and threonine (LT) . The tissue 
continued to grow slowly on this medium and was 
transferred to fresh N6-0.5 medium supplemented with LT 
every 2 weeks. After 12 weeks two clones of actively 

10 growing callus was identified on two separate plates 
containing LT-supplemented medium. These clones 
continued to grow when sub-cultured on the selective 
medium. The presence of the JLys£-M4 gene in the 
selected clones was confirmed by PCR analysis. Callus 

15 was transferred to medium that promotes plant 
regeneration . 

EXAMPLE 18 
Transformation of Com with, .the 

Constitutive Corn Promoter/gts/eco dapA and 

20 Constitutive Corn Prorcpt,ftr/cts/lysC-M4 

The chimeric gene cassettes, HH534 5' region/ 
mcts/ecodapA/HH2-l 3' region plus HH534 5 1 region/ 
mcts/l£s£-M4/HH2-l 3 f region, (Example ]5) were inserted 
into the vector pGem9z to generate a corn transformation 

25 vector. Plasmid pBT583 (Example 6) was digested with 
Sal I and an 1850 bp fragment containing the HH534 5' 
region/mcts/ecoda£&/HH2-l 3' region gene cassette was 
isolated. This DNA fragment was inserted into pBT573 
(Example 6) , which carries the HH534 5 1 region/mcts/ 

30 Ixs£-M4/HH2-1 3' region, digested with Xho I. The 

resulting vector with both chimeric genes in the same 
orientation was designated pBT586. 

Vector pBT586 was introduced into embryogenic corn 
callus tissue using the particle bombardment method. 

35 The establishment of the embryogenic callus cultures and 
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the parameters for particle bombardment were as 
described in Example 13. 

Either one of two plasmid vectors containing 
selectable markers were used in the transformations. 
5 One plasmid, pALSLUC [Fromm et al. (1990) Biotechnology 
8:833-839] , contained a cDNA of the maize acetolactate 
synthase (ALS) gene. The ALS cDNA had been mutated in 
Vitro so that the enzyme coded by the gene would be 
resistant to chlorsulfuron . This plasmid also contains a 

10 gene that uses the 35S promoter from Cauliflower Mosaic 
Virus and the 3 f region of the nopaline synthase gene to 
express a firefly luciferase coding region [de Wet et 
al. (1987) Molec. Cell Biol. 7:725-737]. The other 
plasmid, pDETRIC, contained the bar gene from 

15 Streptomyrfifi hygroscopic that confers resistance to 
the herbicide glufosinate [Thompson et al. (1987 The 
EMBO Journal 6:2519-2523]. The bacterial gene had its 
translation codon changed from GTG to ATG for proper 
translation initiation in plants [De Block et al. (1987) 

20 The EMBO Journal 6:2513-2518]. The bar gene was driven 
by the 35S promoter from Cauliflower Mosaic Virus and 
uses the termination and polyadenylation signal from the 
octopine synthase gene from AarobartPriiim tumef ari^.Q . 

For bombardment, 2.5 jig of each plasmid, pBT586 and 

25 one of the two selectable marker plasmids, was co- 
precipitated onto the surface of gold particles as 
described in Example 13. Bombardment of the embryogenic 
tissue cultures was also as described in Example 13. 
Seven d after bombardment the tissue was 

30 transferred to selective medium. The tissue bombarded 
with the selectable marker pALSLUC was transferred to 
N6-0.5 medium that contained chlorsulfuron (30 ng/L) and 
lacked casein or proline. The tissue bombarded with the 
selectable marker, pDETRIC, was transferred to N6-0.5 

35 medium that contained 2 mg/L glufosinate and lacked 
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casein or proline. The tissue continued to grow slowly 
on these selective media. After an additional 2 weeks 
the tissue was transferred to fresh N6-0.5 medium 
containing the selective agents. 
5 Chlorsulfuron- and glufosinate-resistance callus 

clones could be identified after an additional 6-8 
weeks. These clones continued to grow when transferred 
to the selective media. 

The presence of pBT586 in the transformed clones 

10 has been confirmed by PCR analysis. Functionality of 
the introduced AK enzyme was tested by plating out 
transformed clones on N6-0.5 media containing 2 raM each 
of lysine and threonine (LT selection; see example 13) . 
All of the clones were capable of growing on LT medium 

15 indicating that the EL. coli aspartate kinase was 

expressed and was functioning properly. To test that 
the £. coli DHDPS enzyme was functional/ transformed 
callus was plated on N6-0.5 media containing 2jlm 
2-aminoethylcysteine (AEC) , a lysine analog and potent 

20 inhibitor of plant DHDPS. The transformed callus tissue 
was resistant to AEC indicating that the introduced 
DHDPS, which is about 16-fold less sensitive to AEC than 
the plant enzyme, was being produced and was functional. 
Plants have been regenerated from several transformed 

25 clones and are being grown to maturity* 

EXAMPLE 19 
Transformation of Soybean with the 
Phaseolin Promoter/cts/eorriapA and 
Phaseolin Promoter /cfca/lyaC-MA Chi meric Genes 
30 The chimeric gene cassettes, phaseolin 5 1 region/ 

cts/cordajifc/phaseolin 3' region plus phaseolin 5 f 
region/cts/ly_s£-M4 /phaseolin 3 l , (Example 6) were 
inserted into the soybean transformation vector pBT603 
(Figure 5) . This vector has a soybean transformation 
35 marker gene consisting of the 35S promoter from 
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Cauliflower Mosaic Virus driving expression of the 
£. CQli p-glucuronidase gene [Jefferson et al. (1986) 
Proc. Natl. Acad. Sci. USA 83:8447-8451] with the Nos 3 f 
region in a modified pGEM9Z plasmid. 
5 To insert the phaseolin 5 1 reqion/cts/ lysC- M4/ 

phaseolin 3 ! region, the gene cassette was isolated as a 
3.3 kb Hind III fragment and inserted into Hind III 
digested pBT603, yielding plasmid pBT609. This binary 
vector has the chimeric gene f phaseolin 5' region/ 

10 cts/lys£-M4 /phaseolin 3' region inserted in the opposite 
orientation from the 35S/GUS/Nos 3' marker gene. 

To insert the phaseolin 5 f region/cts/cordaeA/ 
phaseolin 3' region 3 f , the gene cassette was isolated as 
a 2.7 kb BamH I fragment (as described in Example 15) 

15 and inserted into BamH I digested pBT609, yielding 
plasmid pBT614. This vector has both chimeric genes, 
phaseolin 5 f region/cts/cordapA/phaseolin 3 f region 
inserted in the same orientation, and both are in the 
opposite orientation from the 35S/GUS/Nos 3 ■■ marker 

20 gene. 

Soybean was transformed with plasmid pBT6l4 
according to the procedure described in United States 
Patent No. 5,015,580. Soybean transformation was 
performed by Agracetus Company (Middleton, WI) . 

25 EXAMPLE 70 . 

Isolation nf g P1* nt 

Lysine Retool utarat.e Reducta^ 
Lysine Ketoglutarate Reductase (LKR) enzyme 
activity has been observed in immature endosperm of 
30 developing maize seeds [Arruda et al. (1982) Plant 

Physiol. 69:988-989]. LKR activity increases sharply 
from the onset of endosperm development, reaches a peak 
level at about 20 d after pollination, and then declines 
[Arruda et al. (1983) Phytochemistry 22:2687-2689]. 
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In order to clone the corn LKR gene r RNA was 
isolated from developing seeds 19 d after pollination. 
This RNA was sent to Clontech Laboratories, Inc., (Palo 
Alto, CA) for the custom synthesis of a cDNA library in 
5 the vector Lambda Zap II. The conversion of the Lambda 
Zap II library into a phagemid library, then into a 
plasmid library was accomplished following the protocol 
provided by Clontech. Once converted into a plasmid 
library the ampicillin-resistant clones obtained carry 

10 the cDNA insert in the vector pBluescript SK(-) . 

Expression of the cDNA is under control of the lacZ 
promoter on the vector. 

Two phagemid libraries were generated using the 
mixtures of the Lambda Zap II phage and the filamentous 

15 helper phage of 100 |!L to 1 J1L. Two additional 

libraries were generated using mixtures of 100 JIL Lambda 
Zap II to 10 \LL helper phage and 20 ^L Lambda Zap II to 
10 jiL helper phage. The titers of the phagemid 
preparations were similar regardless of the mixture used 
20 and were about 2 x 10 3 ampicillin-resistant- 

trans feet ants per mL with £. soli strain XLl-Blue as 
the host and about 1 x 10 3 with DE126 (see below) as 
host. 

To select clones that carried the LKR gene a 
25 specially designed £. coli host, DE126 was constructed. 
Construction of DE126 occurred in several stages. 

(1) A generalized transducing stock of coliphage Plvir 
was produced by infection of a culture of TST1 [F"/ 
axaB.139, delta (axg£-lfi£) 205, £lfc5301, j2£aF25, relA l. 

30 X&SL150, malE52: :TnlQ, deoCl- Ar] (£L* coli Genetic Stock 
Center #6137) using a standard method (for Methods see 
J. Miller, Experiments in Molecular Genetics) . 

(2) This phage stock was used as a donor in a 
transductional cross (for Method see J. Miller, 

35 Experiments in Molecular Genetics) with strain GIF106M1 
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[F~, arg-, ilvA 296 r lysC 10Ql r thrA HOl, metL 10QQ r AT' 
XR&L9, maill, mtl-2, thil(?) , £UBE44<?)] (£. coli 

Genetic Stock Center #5074) as the recipient. 
Recombinants were selected on rich medium [L 
5 supplemented with DAP] containing the antibiotic 
tetracycline. The transposon TnlO f conferring 
tetracycline resistance, is inserted in the malE gene of 
strain TST1. Tetracycline-resistant transductants 
derived from this cross are likely to contain up to 2 

10 min of the £. chromosome in the vicinity of ma IE . 

The genes mal£ and lysC are separated by less than 0.5 
minutes, well within cot ran s duct ion distance. 

(3) 200 tetracycline-resistant transductants were 
thoroughly phenotyped; appropriate fermentation and 

15 nutritional traits were scored. The recipient strain 
GIF106M1 is completely devoid of aspartokinase isozymes 
due to mutations in thrA r msJLL and lysC r and therefore 
requires the presence of threonine, methionine, lysine 
and meso-diaminopimelic acid (DAP) for growth. 

20 Transductants that had inherited lysC + with malE ! :Tnl0 
from TST1 would be expected to grow on a minimal medium 
that contains vitamin Bl, L-arginine, L-isoleucine and 
L-valine in addition to glucose which serves as a carbon 
and energy source. Moreover strains having the genetic 

25 constitution of lysC + , meli- and thrA- will only express 
the lysine sensitive aspartokinase. Hence addition of 
lysine to the minimal medium should prevent the growth 
of the l¥S£ + recombinant by leading to starvation for 
threonine, methionine and DAP. Of the 200 tetracycline 
30 resistant transductants examined, 49 grew on the minimal 
medium devoid of threonine, methionine and DAP. 
Moreover, all 49 were inhibited by the addition of 
L-lysine to the minimal medium. One of these 
transductants was designated DE125. DE125 has the 

35 phenotype of tetracycline resistance, growth 
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requirements for arginine f isoleucine and valine, and 
sensitivity to lysine. The genotype of this strain is 
F~ mal£52: :TnlO arg- ilvA 296 £]n£1101 mstLlOOO lambda- 
rpsL 9 malXl xyl- 7 m£l-2 thil m supE 44 (?) . 
5 (4) This step involves production of a male 

derivative of strain DE125. Strain DE125 was mated with 
the male strain AB1528 [F , 16/delta(cao£--Ex^) 62, lacY l or 
la££4, glnV 44 P aalE2 rac~(?), his£4, rfbd l r mgl-51, 
JsdgKSK?), ±lz£Ll, acg£3, £hl-l] (£L. coli Genetic Stock 

10 Center #1528) by the method of conjugation. F'16 

carries the ilvGMEDAYC gene cluster. The two strains 
were cross streaked on rich medium permissive for the 
growth of each strain. After incubation, the plate was 
replica plated to a synthetic medium containing 

15 tetracycline, arginine, vitamin Bl and glucose. DE125 
cannot grow on this medium because it cannot synthesize 
isoleucine. Growth of AB1528 is prevented by the 
inclusion of the antibiotic tetracycline and the 
omission of proline and histidine from the synthetic 

20 medium. A patch of cells grew on this selective medium. 
These recombinant cells underwent single colony 
isolation on the same medium. The phenotype of one 
clone was determined to be Ilv + / Arg", TetR, Lysine- 
sensitive, male specific phage (MS2) -sensitive, 

25 consistent with the simple transfer of F'16 from AB1528 
to DE125. This clone was designated DE126 and has the 
genotype F f 16/mal£52: :TnlO, arg~, 11E&296, thrAllOl, 
mstLlOO, lysC*, AT, xesL9, main, xyi- 7. mti-2, thi-1?, 
SUPE44?. It is inhibited by 20 Jlg/mL of L-lysine in a 

30 synthetic medium. 

To select for clones from the corn cDNA library 
that carried the LKR gene, 100 \JLL of the phagemid 
library was mixed with 100 JiL of an overnight culture of 
DE126 grown in L broth and the cells were plated on 

35 synthetic media containing vitamin Bl, L-arginine, 
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glucose as a carbon and energy source, 100 |Xg/mL 
ampicillin and L-lysine at 20, 30 or 40 Jlg/mL, Four 
plates at each of the three different lysine 
concentrations were prepared. The amount of' phagemid 
5 and DE126 cells was expected to yield about 1 x 10 5 
ampicillin-resistant transfectants per plate. Ten to 
thirty lysine-resistant colonies grew per plate (about 1 
lysine-resistant per 5000 amplcillin-resistant 
colonies) . 

10 Plasmid DNA was isolated from 10 independent clones 

and retransformed into DE126. Seven of the ten DNAs 
yielded lysine-resistant clones demonstrating that the 
lysine-resistance trait was carried on the plasmid. The 
reported size of corn LKR is about 140,000 daltons, 

15 requiring an mRNA of at least 3.8 kb. Restriction 
enzyme analysis of the DNAs showed that clone 11D 
contained a plasmid with an approximately 4 kb insert of 
corn DNA, large enough to carry a full-length cDNA. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: E. I. DU PONT DE NEMOURS AND 

COMPANY 
TITLE OF INVENTION: 



(ii) 



(iii) 
(iv) 



(v) 



(vi) 



(vii) 



(viii) 



NUCLEIC ACID 
FRAGMENTS AND 
METHODS FOR 
INCREASING THE 
LYSINE AND THREONINE 
CONTENT OF THE SEEDS 
OF PLANTS 
NUMBER OF SEQUENCES: 24 
CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: E. I. DU PONT DE NEMOURS 

AND COMPANY 

(B) STREET: 1007 MARKET STREET 

(C) CITY: WILMINGTON 

(D) STATE : DELAWARE 

(E) COUNTRY: U.S.A. 

(F) ZIP: 19898 
COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.50 INCH 

(B) COMPUTER: MACINTOSH 

(C) OPERATING SYSTEM: MACINTOSH, 6.0 

(D) SOFTWARE: MICROSOFT WORD, 4.0 
CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: BB-1037-A 

(B) FILING DATE: 

(C) CLASSIFICATION: 
PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/855,414 

(B) FILING DATE: 19 MARCH 1992 
ATTORNEY/AGENT INFORMATION: 

(A) NAME : LINDA AXAMETHY FLOYD 

(B) REGISTRATION NUMBER: 33,692 
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(C) REFERENCE /DOCKET NUMBER: BB-1037-A 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : 302-992-4 92 9 

(B) TELEFAX: 302-892-7949 

(C) TELEX: 835420 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1350 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



ATG GCT GAA ATT GTT GTC TCC AAA TTT GGC GGT ACC AGC GTA GCT GAT 48 
Met Ala Glu He Val Val Ser Lys Phe Gly Gly Thr Ser Val Ala Asp 
15 10 15 



TTT GAC GCC ATG AAC CGC AGC GCT GAT ATT GTG CTT TCT GAT GCC AAC 96 
Phe Asp Ala Met Asn Arg Ser Ala Asp He Val Leu Ser Asp Ala Asn 
20 25 30 



GTG CGT TTA GTT GTC CTC TCG GCT TCT GCT GGT ATC ACT AAT CTG CTG 144 
Val Arg Leu Val Val Leu Ser Ala Ser Ala Gly He Thr Asn Leu Leu 
35 40 45 

GTC GCT TTA GCT GAA GGA CTG GAA CCT GGC GAG CGA TTC GAA AAA CTC 192 
Val Ala Leu Ala Glu Gly Leu Glu Pro Gly Glu Arg Phe Glu Lys Leu 
50 55 60 
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GAC GCT ATC CGC AAC ATC CAG TTT GCC ATT CTG GAA CGT CTG CGT TAC 
Asp Ala lie Arg Asn lie Gin Phe Ala lie Leu Glu Arg Leu Arg Tyr 
65 70 75 80 



240 



CCG AAC GTT ATC CGT GAA GAG ATT GAA CGT CTG CTG GAG AAC ATT ACT 
Pro Asn Val lie Arg Glu Glu lie Glu Arg Leu Leu Glu Asn lie Thr 
85 90 95 



288 



GTT CTG GCA GAA GCG GCG GCG CTG GCA ACG TCT CCG GCG CTG ACA GAT 
Val Leu Ala Glu Ala Ala Ala Leu Ala Thr Ser Pro. Ala Leu Thr Asp 
100 105 110 



336 



GAG CTG GTC AGC CAC GGC GAG CTG ATG TCG ACC CTG CTG TTT GTT GAG 
Glu Leu Val Ser His Gly Glu Leu Met Ser Thr Leu Leu Phe Val Glu 
115 120 125 



384 



ATC CTG CGC GAA CGC GAT GTT CAG GCA CAG TGG TTT GAT GTA CGT AAA 
lie Leu Arg Glu Arg Asp Val Gin Ala Gin Trp Phe Asp Val Arg Lys 
130 135 140 



432 



GTG ATG CGT ACC AAC GAC CGA TTT GGT CGT GCA GAG CCA GAT ATA GCC 
Val Met Arg Thr Asn Asp Arg Phe Gly Arg Ala Glu Pro Asp lie Ala 
145 150 155 160 



480 



GCG CTG GCG GAA CTG GCC GCG CTG CAG CTG CTC CCA CGT CTC AAT.GAA 
Ala Leu Ala Glu Leu Ala Ala Leu Gin Leu Leu Pro Arg Leu Asn Glu 
165 170 175 



528 



GGC TTA GTG ATC ACC CAG GGA TTT ATC GGT AGC GAA AAT AAA GGT CGT 
Gly Leu Val He Thr Gin Gly Phe He Gly Ser Glu Asn Lys Gly Arg 
180 185 190 



576 



ACA ACG ACG CTT GGC CGT GGA GGC AGC GAT TAT ACG GCA GCC TTG CTG 
Thr Thr Thr Leu Gly Arg Gly Gly Ser Asp Tyr Thr Ala Ala Leu Leu 
195 200 205 



624 
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GCG GAG GCT TTA CAC GCA TCT CGT GTT GAT ATC TGG ACC GAC GTC CCG 
Ala Glu Ala Leu His Ala Ser Arg Val Asp He Trp Thr Asp Val Pro 
210 215 220 



672 



GGC ATC TAC ACC ACC GAT CCA CGC GTA GTT TCC GCA GCA AAA CGC ATT 
Gly He Tyr Thr Thr Asp Pro Arg Val Val Ser Ala Ala Lys Arg lie 
225 230 235 240 



720 



GAT GAA ATC GCG TTT GCC GAA GCG GCA GAG ATG GCA ACT TTT GGT GCA 
Asp Glu He Ala Phe Ala Glu Ala Ala Glu Met Ala Thr Phe Gly Ala 
245 250 255 



768 



AAA GTA CTG CAT CCG GCA ACG TTG CTA CCC GCA GTA CGC AGC GAT ATC 
Lys Val Leu His Pro Ala Thr Leu Leu Pro Ala Val Arg Ser Asp He 
260 265 270 



816 



CCG GTC TTT GTC GGC TCC AGC AAA GAC CCA CGC GCA GGT GGT ACG CTG 
Pro Val Phe Val Gly Ser Ser Lys Asp Pro Arg Ala Gly Gly Thr Leu 
275 280 285 



864 



GTG TGC AAT AAA ACT GAA AAT CCG CCG CTG TTC CGC GCT CTG GCG CTT 
Val Cys Asn Lys Thr Glu Asn Pro Pro Leu Phe Arg Ala Leu Ala Leu 
290 295 300 



912 



CGT CGC AAT CAG ACT CTG CTC ACT TTG CAC AGC CTG AAT ATG CTG CAT 
Arg Arg Asn Gin Thr Leu Leu Thr Leu His Ser Leu Asn Met Leu His 
305 310 315 320 



960 



TCT CGC GGT TTC CTC GCG GAA GTT TTC GGC ATC CTC GCG CGG CAT AAT 1008 
Ser Arg Gly Phe Leu Ala Glu Val Phe Gly lie Leu Ala Arg His Asn 
325 330 335 
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ATT TCG GTA GAC TTA ATC ACC ACG TCA GAA GTG AGC GTG GCA TTA ACC 1056 
lie Ser Val Asp Leu He Thr Thr Ser Glu Val Ser Val Ala Leu Thr 
340 345 350 

CTT GAT ACC ACC GGT TCA ACC TCC ACT GGC GAT ACG TTG CTG ACG CAA 1104 
Leu Asp Thr Thr Gly Ser Thr Ser Thr Gly Asp Thr Leu Leu Thr Gin 
355 360 365 

TCT CTG CTG ATG GAG CTT TCC GCA CTG TGT CGG GTG GAG GTG GAA GAA 1152 
Ser Leu Leu Met Glu Leu Ser Ala Leu Cys Arg Val Glu Val Glu Glu 
370 375 380 

GGT CTG GCG CTG GTC GCG TTG ATT GGC AAT GAC CTG TCA AAA GCC TGC 1200 
Gly Leu Ala Leu Val Ala Leu He Gly Asn Asp Leu Ser Lys Ala Cys 
385 390 395 400 

GCC GTT GGC AAA GAG GTA TTC GGC GTA CTG GAA CCG TTC AAC ATT CGC 1248 
Ala Val Gly Lys Glu Val Phe Gly Val Leu Glu Pro Phe Asn He Arg 
405 410 415 

ATG ATT TGT TAT GGC GCA TCC AGC CAT AAC CTG TGC TTC CTG GTG CCC 1296 
Met He Cys Tyr Gly Ala Ser Ser His Asn Leu Cys Phe Leu Val Pro 
420 425 430 

GGC GAA GAT GCC GAG CAG GTG GTG CAA AAA CTG CAT AGT AAT TTG.TTT 1344 
Gly Glu Asp Ala Glu Gin Val Val Gin Lys Leu His Ser Asn Leu Phe 
435 440 445 

GAG TAA 1350 
Glu * 
450 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GATCCATGGC TGAAATTGTT GTCTCCAAAT TTGGCG 36 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GTACCGCCAA ATTTGGAGAC AACAATTTCA GCCATG 36 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CCCGGGCCAT GGCTACAGGT TTAACAGCTA AGACCGGAGT AGAGCACT 48 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GATATCGAAT TCTCATTATA GAACTCCAGC TTTTTTC 37 

(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 911 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CC ATG GCT ACA GGT TTA ACA GCT AAG ACC GGA GTA GAG CAC TTC GGC 47 
Met Ala Thr Gly Leu Thr Ala Lys Thr Gly Val Glu His Phe Gly 
15 10 15 

ACC GTT GGA GTA GCA ATG GTT ACT CCA TTC ACG GAA TCC GGA GAC ATC 95 
Thr Val Gly Val Ala Met Val Thr Pro Phe Thr Glu Ser Gly Asp lie 
20 25 30. 

GAT ATC GCT GCT GGC CGC GAA GTC GCG GCT TAT TTG GTT GAT AAG GGC 143 
Asp lie Ala Ala Gly Arg Glu Val Ala Ala Tyr Leu Val Asp Lys Gly 
35 40 45 

TTG GAT TCT TTG GTT CTC GCG GGC ACC AfT GGT GAA TCC CCA ACG ACA 191 
Leu Asp Ser Leu Val Leu Ala Gly Thr Thr Gly Glu Ser Pro Thr Thr 
50 55 60 
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ACC GCC GCT GAA AAA CTA GAA CTG CTC AAG GCC GTT CGT GAG GAA GTT 239 
Thr Ala Ala Glu Lys Leu Glu Leu Leu Lys Ala Val Arg Glu Glu Val 
65 70 75 

GGG GAT CGG GCG AAG CTC ATC GCC GGT GTC GGA ACC AAC AAC ACG CGG 287 
Gly Asp Arg Ala Lys Leu lie Ala Gly Val Gly Thr Asn Asn Thr Arg 
80 85 90 95 

ACA TCT GTG GAA CTT GCG GAA GCT GCT GCT TCT GCT GGC GCA GAC GGC 335 
Thr Ser Val Glu Leu Ala Glu Ala Ala Ala Ser Ala Gly Ala Asp Gly 
100 105 110 

CTT TTA GTT GTA ACT CCT TAT TAC TCC AAG CCG AGC CAA GAG GGA TTG 383 
Leu Leu Val Val Thr Pro Tyr Tyr Ser Lys Pro Ser Gin Glu Gly Leu 
115 120 125 

CTG GCG CAC TTC GGT GCA ATT GCT GCA GCA ACA GAG GTT CCA ATT TGT 431 
Leu Ala His Phe Gly Ala He Ala Ala Ala Thr Glu Val Pro He Cys 
130 135 140 

CTC TAT GAC ATT CCT GGT CGG TCA GGT ATT CCA ATT GAG TCT GAT ACC 479 
Leu Tyr Asp He Pro Gly Arg Ser Gly He Pro He Glu Ser Asp Thr 
145 150 155 

ATG AGA CGC CTG AGT GAA TTA CCT ACG ATT TTG GCG GTC AAG GAC. GCC 527 
Met Arg Arg Leu Ser Glu Leu Pro Thr He Leu Ala Val Lys Asp Ala 
160 165 170 175 

AAG GGT GAC CTC GTT GCA GCC ACG TCA TTG ATC AAA GAA ACG GGA CTT 575 
Lys Gly Asp Leu Val Ala Ala Thr Ser Leu He Lys Glu Thr Gly Leu 
180 185 190 

GCC TGG TAT TCA GGC GAT GAC CCA CTA AAC CTT GTT TGG CTT GCT TTG 623 
Ala Trp Tyr Ser Gly Asp Asp Pro Leu Asn Leu Val Trp Leu Ala Leu 
195 200 205 
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GGC GGA TCA GGT TTC ATT TCC GTA ATT GGA CAT GCA GCC CCC ACA GCA 671 
Gly Gly Ser Gly Phe lie Ser Val He Gly His Ala Ala Pro Thr Ala 
210 215 220 

TTA CGT GAG TTG TAC ACA AGC TTC GAG GAA GGC GAC CTC GTC CGT GCG 719 
Leu Arg Glu Leu Tyr Thr Ser Phe Glu Glu Gly Asp Leu Val Arg Ala 
225 230 235 

CGG GAA ATC AAC GCC AAA CTA TCA CCG CTG GTA GCT GCC CAA GGT CGC 767 
Arg Glu He Asn Ala Lys Leu Ser Pro Leu Val Ala Ala Gin Gly Arg 
240 245 250 255 

TTG GGT GGA GTC AGC TTG GCA AAA GCT GCT CTG CGT CTG CAG GGC ATC 815 
Leu Gly Gly Val Ser Leu Ala Lys Ala Ala Leu Arg Leu Gin Gly lie 
260 265 270 

AAC GTA GGA GAT CCT CGA CTT CCA ATT ATG GCT CCA AAT GAG CAG GAA 863 
Asn Val Gly Asp Pro Arg Leu Pro He Met Ala Pro Asn Glu Gin Glu 
275 280 285 

CTT GAG GCT CTC CGA GAA GAC ATG AAA AAA GCT GGA GTT CTA TAA TGAGAATTC 918 
Leu Glu Ala Leu Arg Glu Asp Met Lys Lys Ala Gly Val Leu * 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CTTCCCGTGA CCATGGGCCA TC 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CATGGCTGGC TTCCCCACGA GGAAGACCAA CAATGACATT ACCTCCATTG CTAGCAACGG 60 

TGGAAGAGTA CAATG 75 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CATGCATTGT ACTCTTCCAC CGTTGCTAGC AATGGAGGTA ATGTCATTGT TGGTCTTCCT 60 

CGTGGGGAAG CCAGC 75 

(2) INFORMATION FOR SEQ ID NO: 10 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CATGGCTTCC TCAATGATCT CCTCCCCAGC TGTTACCACC GTCAACCGTG CCGGTGCCGG 60 

CATGGTTGCT CCATTCACCG GCCTCAAAAG 90 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CATGCTTTTG AGGCCGGTGA ATGGAGCAAC CATGCCGGCA CCGGCACGGT TGACGGTGGT 60 

AACAGCTGGG GAGGAGATCA TTGAGGAAGC 90 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CCGGTTTGCT GTAATAGGTA CCA 23 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AGCTTGGTAC CTATTACAGC AAACCGGCAT G 31 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



GCTTCCTCAA TGATCTCCTC CCCAGCT 27 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CATTGTACTC TTCCACCGTT GCTAGCAA 28 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 16: 

GGATCCCCCG GGCTGCAGGA ATTCTACGTA CCATATAGTA AGACTTTGTA TATAAGACGT 60 

CACCTCTTAC GTGCATGGTT ATATGTGACA TGTGCAGTGA CGTTGTACCA TATAGTAAGA 120 

CTTTGTATAT aagACGTCAC CTCTTACGTG CATGGTTATA TGTGACATGT GCAGTGACGT 180 

TAACCGCACC CTCCTTCCCG TCGTTTCCCA TCTCTTCCTC CTTTAGAGCT ACCACTATAT 240 

AAATCAGGGC TCATTTTCTC GCTCCTCACA GGCTCATCAG CACCCCGGCA GTGCCACCCC 300 

GACTCCCTGC ACCTGCCATG GGTACGCTAG CCCGGGAGAT CTGACAAAGC AGCATTAGTC 360 

CGTTGATCGG TGGAAGACCA CTCGTCAGTG TTGAGTTGAA TGTTTGATCA ATAAAATACG 420 

GCAATGCTGT AAGGGTTGTT TTTTATGCCA TTGATAATAC ACTGTACTGT TCAGTTGTTG 480 

AACTCTATTT CTTAGCCATG CCAGTGCTTT TCTTATTTTG AATAACATTA CAGCAAAAAG 540 

TTGAAAGACA AAAAAANNNN NCCCCGAACA GAGTGCTTTG GGTCCCAAGC TTCTTTAGAC 600 

TGTGTTCGGC GTTCCCCCTA AATTTCTCCC CTATATCTCA CTCACTTGTC ACATCAGCGT 660 

TCTCTTTCCC CTATATCTCC ACGCTCTACA GCAGTTCCAC CTATATCAAA CCTCTATACC 720 

CCACCACAAC AATATTATAT ACTTTCATCT TCACCTAACT CATGTACCTT CCAATTTTTT 780 

TCTACTAATA ATTATTTACG TGCACAGAAA CTTAGGCAAG GGAGAGAGAG AGCGGTACC 839 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTAGAAGCCT CGGCAACGTC AGCAACGGCG GAAGAATCCG GTG 43 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CATGCACCGG ATTCTTCCGC CGTTGCTGAC GTTGCCGAGG CTT 43 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19:- 

GATCCCATGG CGCCCCTTAA GTCCACCGCC AGCCTCCCCG TCGCCCGCCG CTCCT 55 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CTAGAGGAGC GGCGGGCGAC GGGGAGGCTG GCGGTGGACT TAAGGGGCGC CATGG 55 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CATGGCGCCC ACCGTGATGA TGGCCTCGTC GGCCACCGCC GTCGCTCCGT TCCAGGGGC 59 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

TTAAGCCCCT GGAACGGAGC GACGGCGGTG GCCGACGAGG CCATCATCAC GGTGGGCGC 59 

(2) INFORMATION FOR SEQ ID NO: 23; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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GCGCCCACCG TGATGA 16 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CACCGGATTC TTCCGC 16 
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CLAIMS 

What is claimed is : 

1. An isolated nucleic acid fragment comprising 
(a) a first nucleic acid subfragment encoding 
5 apartolkinase which is insensitive to inhibition by 
lysine and (b) a second nucleic acid subfragment 
encoding dihydrodipicolinic acid synthase which is at 
least 20-fold less sensitive to inhibition by lysine 
than plant DHDPS • 
10 2. The nucleic acid fragment of Claim 1, wherein 

either : 

(a) the first nucleic acid subfragment 
comprises a nucleotide sequence substantially homologous 
to the sequence shown in SEQ ID NO:l: encoding £. coli 

15 AKIII, said nucleic acid fragment encoding a lysine- 
insensitive variant of £. coli AKIII and further 
characterized in that at least one of the following 
conditions is met: 

(1) the amino acid at position 318 is an 
20 amino acid other than threonine , or 

(2) the amino acid at position 352 is an 
amino acid other than methionine; or 

(b) the second nucleic acid subfragment is 
derived from bacteria. 

25 3. An isolated nucleic acid fragment comprising a 

nucleic acid subfragment encoding lysine ketoglutarate 
reductase, 

4. The nucleic acid fragment of Claim 1 further 
comprising (c) a third nucleic acid subfragment encoding 

30 antisense lysine ketoglutarate reductase. 

5. The nucleic acid fragment of Claim 1 wherein 
each subfragment is operably linked to a suitable plant 
chloroplast transit sequence and each subfragment is 
operably linked to a suitable regulatory sequence to 

35 promote expression in seeds of transformed plants. 
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6. The nucleic acid fragment of Claim 4 wherein 
the first and second subfragments are each operably 
linked to a suitable plant chloroplast transit sequence 
and each subfragment is operably linked to a suitable 

5 regulatory sequence to promote expression in seeds of 
transformed plants. 

7 . An isolated nucleic acid fragment comprising 
at least one nucleotide sequence substantially 
homologous to the sequence shown in SEQ ID NO:l encoding 

10 £. AKIII, said nucleic acid fragment encoding a 

lysine-insensitive variant of £. coli AKIII and further 
characterized in that at least one of the following 
conditions is met : 

(a) the amino acid at position 318 is an 
15 amino acid other than threonine , or 

(b) the amino acid at position 352 is an 
amino acid other than methionine, 

8. An isolated nucleic acid fragment of Claim 7, 
the fragment operably linked to a plant chloroplast 

20 transit sequence and operably linked to a suitable 

regulatory sequence to promote expression in seeds of 
transformed plants. 

9. A plant comprising in its genome the nucleic 
acid fragment of Claims 5 or 6. 

25 10. A plant comprising in its genome the nucleic 

acid fragment of Claim 7. 

11. Seeds obtained from the plant of Claims 9 or 

10. 

12. A microorganism transformed with the nucleic 
30 acid fragment of Claims 1, 3, or 7. 

13. A method for increasing the threonine content 
of the seeds of plants comprising: 

(a) transforming plant cells with the nucleic 
acid fragment of Claim 8; 



WO 93/19190 



126 



PCT/US93/02480 



(b) growing fertile mature plants from the 

transformed plant cells obtained from step 

(a) under conditions suitable to obtain 
seeds; and 

5 (c) selecting from the progeny seed of step 

(b) for those seeds containing increased 
levels of threonine. 

14 . A method for increasing the lysine content of 
the seeds of plants comprising: 

10 (a) transforming plant cells with the nucleic 

acid fragment of Claims 5 or 6; 

(b) growing fertile mature plants from the 

transformed plant cells obtained from step (a) under 

conditions suitable to obtain seeds; and 
15 (c) selecting from the progeny seed of step 

(b) those seeds containing increased levels of lysine. 

15. A plant produced by the method of Claim 13 for 
increasing the threonine content of the seeds of plants, 
wherein the plant is capable of transmitting said 

20 nucleic acid fragment to a progeny plant and wherein the 
progeny plant has the ability to produce levels of free 
threonine at least two times greater than the free 
threonine levels of plants not containing the nucleic 
acid fragment. 

25 16. A plant produced by the method of Claim 14 for 

increasing the lysine content of the seeds of plants, 
wherein the plant is capable of transmitting said 
nucleic acid fragments to a progeny plant and wherein 
the progeny plant has the ability to produce levels of 

30 free lysine at least two times greater than free lysine 
levels of plants not containing the nucleic acid 
fragment . 

17. A plant comprising in its genome: 

(a) a first nucleic acid subfragment encoding 
35 AK which is insensitive to inhibition by lysine; 
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(b) a second nucleic acid subfragment 
encoding DHDPS which is at least 20-fold less sensitive 
to inhibition by lysine than plant DHDPS; and 

(c) a third nucleic acid subfragment to 
5 express antisense RNA for lysine ketoglutarate 

reductase . 
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